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WORD CHANGES INTRODUCED AD LIBITUM IN FIVE 
SPEECHES BY FRANKLIN DELANO ROOSEVELT* 


LAURA CROWELL 


University of Washington 


RANKLIN ROOSEVELT worked 
F aitigently on his speeches. That he 
regularly provided the basic thoughts 
which he wanted incorported in an ad- 
dress—indeed, often dictated a few pages 
of his own thinking as a starting-point 
—and then .worked again and again 
with his speech assistants to bring: the 
manuscript to the precise length, con- 
tent and tone he desired is now widely 
understood, Both the anticipated im- 
mediate audience and the thousands of 
unseen radio listeners were considered 
in this exhilarating, exhausting process 
of readying the final draft; Roosevelt 
also often read the developing para- 
graphs aloud to test their oral quality. 

In Roosevelt's have 
particularly apt material for a study of 
verbal changes made during the act of 
speaking. We are able to know what he 
planned to say, for the Reading Copies 
of his major addresses are available for 
study at the Hyde Park Library. We 
are able to know also what he actually 


addresses we 


said, for the electrical transcriptions of 
his speeches, which reveal his spontan- 
eous alterations, are available from the 
National Archives. 
One would not expect to discover any 
large number of misreadings because 
* The electrical transcriptions were obtained 


through funds from the Agnes Anderson Fund, 
University of Washington. 


this was a presentation of well-known 
material by a speaker who took pride 
in reading well. Furthermore, no one 
knew better than did Roosevelt that if 
any spontaneous alteration were to re- 
sult in an error of fact, a shallowness of 
inference, or an indication of stigma or 
unintentional sarcasm, he would have 
put a weapon in the hands of ready and 
gleeful attackers of him and his policies 
and often, indeed, of his country as 
well. Another factor which might have 
held back his deliberate ad-libs was the 
urging of his speech assistants that he 
stick to the text which 
painstakingly prepared.+ 


had been so 


Yet during the presentation of even 
major addresses Roosevelt did make 
changes. A careful examination of his 
delivery of five selected addresses to 
large immediate audiences and to the 
country by radio will reveal the nature 
and extent of his ad-libbing and provide 
a basis for speculation as to his reasons 
for departing from the words of the 
manuscripts. 

What was the nature of the audience 
before whom these addresses were given, 


1See James M. Burns, Roosevelt: The Lion 
and the Fox (New York: Harcourt, Brace, 
1956), 421; Samuel I Rosenman, Working 
With Roosevelt (New York: Harper & Bros., 
1952), p. 486; Robert E. Sherwood, Roosevelt 
and Hopkins (New York: Harper & Bros., 
1950), p. 218. 
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the throngs of listeners whose thun- 
derous presence and response were a 
counterstimulation to the President? 
The millions beside their radios were 
not a part of this vigorous interaction; 
adaptation to their anticipated attitudes 
had been woven the fabric and 
fashioning of the addresses by FDR and 
the speech assistants, but adaptation to 
the thousands before him was a con- 
tinuing matter and, therefore, the focus 
of his efforts in spontaneous alteration. 


into 


Some 60,000 people jammed into the 
National League Baseball field in Pitts- 
burgh for the October 1, 1936, cam- 
paign address; ardent Democrats, noisily 
celebrating their success of the previous 
November heard the Victory Dinner 
Address in Washington, D. C., 
March 4, 1937; a huge partisan crowd 
thundered applause at his rhythmic jibe 
at “Martin, Barton and Fish” in the ex- 
citement-rouser at Madison Square Gar- 
den, October 28, 1940; the Houses of 
Congress, assembled for the annual pres- 
idential message to Congress, heard 
Roosevelt explain the Lend-Lease Plan 
and set forth the Four Freedoms on Jan- 
uary 6, 
one thousand strong, in a crowded ban- 
quet hall in the Statler Hotel delighted 
in his masterful telling of the Fala in- 
cident in the Teamsters’ Union Address 
in Washington, D. C. September 23, 
1944.2 These, then, were the situations: 
three campaign rallies, a political din- 
ner meeting, an assembled Congress. It 
was in oral presentation before these 
groups and in adjustment to their re- 
sponses that FDR made the ad-libs to 
be studied here. 


on 


1941; enthusiastic supporters, 


By ad-libbing in these addresses we 
mean _ substitutions, 
tions, deletions, transpositions from the 


additions, inser- 


2 Rosenman, who worked for 17 years with 
FDR on his speeches, considered this address 
the best political campaign speech he ever 
made. Op. cit., p. 248. 
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copy he had in hand. Taking rough ac- 
count of the varied lengths of the 
speeches and using the Pittsburgh ad- 
dress with its 172 instances of sponta- 
neous alteration as a basis, we find that 
the other two campaign addresses had 
nearly as many. On the other hand, the 
Victory Dinner address to a friendly au- 
dience but on the hazardous Supreme 
Court issue had only one-half as many, 
and the Four Freedoms address on a 
safer theme but to the more formal Con- 
gressional audience had only one-third 
as many.* Thus, of these five addresses, 
the campaign speeches seemed to allow 
him greater freedom in ad-libbing than 
did the other occasions. 

Detailed these ad-libs 
shows them to fall into two major class- 


analysis of 


es. Roosevelt appears to have continued 
under the impact of the speaking situa- 
tion itself the work which he and his 
speech assistants had pursued so as- 
siduously in their labors on the drafts, 
the result being merely a further tailor- 
ing of the text. But in the oral presenta- 
tion before the massed listeners and in 
the very act of communication other 
changes occurred, changes directly as- 
sociated with this immediate speaker- 
audience relationship; these alterations 
form a second type of ad-libs. 


FURTHER TAILORING Ap-LIBs 


In an earlier study of the Four Free- 
doms address it was discovered that the 
alterations made during the working of 
the seven drafts arranged themselves in 
five categories: relationships, concrete- 
ness, force, motivation, and composi- 
tion.* It seems logical that if we now 

8 These comparisons indicate Roosevelt's 
amount of ad-libbing in only the most general 
way, of course, since they refer to the number 
of times he deviated from the text in any wa! 
and not to the nature or extent of the in 
troduced or omitted material. 

4 Laura Crowell, “The Building of the ‘Fou: 


Freedoms’ Address,” Speech Monographs, XXII 
(November 1955), 266-283. 
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AD-LIBBING IN FIVE SPEECHES BY FDR 


find ad-libs falling into these same cat- 
egories we may conclude that FDR in 
the very act of presentation was truly 
continuing the work which had been 
halted by the necessity of binding the 
Reading Copy (now typed triple-space 
on thin, non-rustling sheets) in the limp 
black folder ready for use on the plat- 
form. 


Relationships 


Roosevelt’s desire to make his state- 
ments inclusive, to avoid neglect of any 
element, continued within the actual 
presentation of the speeches: in declar- 
ing that his Administration had planned 
to protect “the small business, the small 
corporation, the small shop” from de- 
flation, he made the impromptu addi- 
tion, “and the small individual” (Pitts- 
13);° in speaking of the faith of Amer- 
icans in “political and economic democ- 
racy,” he inserted “and social” (Vic-3); 
in asserting that without repeal of the 
embargo no help would have been af- 
forded Britain, he expanded by adding 
“and a lot of other nations” (MSG-55). 
It seems that his drive to be inclusive 
remained vigorously at work while he 
was actually addressing the audience. 

Despite FDR’s liking for specificity, 
his relational, movement sense was per- 
haps even stronger; for example, in re- 
ferring to an opponent's speech, he al- 
tered his text from “in June, 1938” to 
“a little over two years ago” (MSG-22). 
Furthermore, what was implicit through 
the sentence structure provided in the 
Reading Copy FDR sometimes made ex- 
plicit in his oral presentation by adding 

5 Throughout the manuscript reference to 
the speeches will be made as follows: Pitis for 
the Pittsburgh address; Vic for the Victory Din- 
ner address; MSG for the Madison Square 
Garden speech; FF for the Four Freedoms ad- 
dress; TU for the Teamsters’ Union address. 
The numbers will designate the paragraphs as 
presented in The Public Papers and Addresses 


of Franklin D. Roosevelt, compiled by Samuel 
I. Rosenman. 
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the relation-giving word spontaneously; 
he ad-libbed to express relationships in 
these sentences: “I rejected that advice 
because nature was in an angry mood” 
(Pitts-11);° “efforts of this Administra- 
tion to arm itself adequately for pur- 
poses of defense” (MSG-10); “when gov- 
ernment finance or any other kind of 
finance is honest” (Pitts-2). He also 
verbalized relationships for which words 
were less clearly suggested by the Read- 
ing Copy; for instance, speaking of the 
cooperation which able men of other 
politics were giving the Administration, 
FDR then ad-libbed as follows: “And, 
the other way ’round, this Government 
is cooperating with them—one hundred 
per cent” (MSG-12). 


These motivations to clarify relation- 
ships, so strongly felt by FDR and at 
work all the time during the prepara- 
tion of the manuscript, seem to have 
continued their influence while he was 
in the act of speaking. 


Concreteness 


In spite of the care taken at every 
stage of preparation of the drafts to 
put ideas in specific terms, FDR never- 
theless in a few instances added details 
while speaking. These details were 
sometimes geographical, as the addition 
of Louisville and Memphis to his state- 
ment about “men piling sandbags on 
the levees at Cairo” (Vic-54), and of the 
Connecticut River when speaking of 
the extension of flood control projects 
(Vic-51). Sometimes the details added 
were even numerical, as when the script 
referred to the amount of American 
money sent abroad in a single decade as 
“not merely eight billions but many 


6 Where note is to be taken within quota- 
tions from the addresses of alterations made 
in the act of speaking, all changes except omis- 
sions will be italicized; omitted elements will 
be indicated by parentheses. For example, here 
the word because was ad-libbed. 
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more billions’ and FDR added “about 
fourteen billions’ (Pitts-28). 

Again, the details added spelled out a 
purpose more specifically; for example, 
when speaking of “landing vessels’ as 
one of the types of war equipment that 
American industry had produced, he 
inserted the explanation, “to get the 
troops ashore” (TU-22). He brought 
abstractions down to earth with concrete 
elements: after the phrases “to restore 
purchasing power, to raise values” he 
added “to put people back to work” 
(Pitts-19). 

Thus, FDR continued to alter his 
speeches during their presentation in 
the direction of greater concreteness. 
Force 


Despite the concerted 


attempt by 
Roosevelt and his speech writers to de- 
velop the strongest verbal and _ syntac- 
tical instrument possible he yet found 
occasion in these addresses to add words 
to gain strength for the vocalization of 
the ideas. At one point he added the 
idea of desirability to that of capability 
when he inserted a vigorous clause: 
“Since we could not get the money by 
taxes and ought not to have got it by 
taxes in those years we borrowed it” 
(Pitts-23). 

When the words in the text did not 
seem capable of carrying the vocal 
weight of the idea he was trying to ex- 
press, he ad-libbed to provide himself 
a more suitable verbal vehicle. Some- 
times the ad-libs merely asserted the 
strength of the idea: “The Republican 
leaders were definitely and beyond per- 
adventure of doubt trying to block our 
efforts toward national defense’ (MSG- 
19). At times words were added to per- 
mit the amplification of an idea, to pro- 
vide for an expansiveness he was feel- 
ing as he spoke, as in these examples: 
“progress that’s being made in the whole 
picture of our national defense” (MSG- 


14); “production of all the types of ma- 
chines and tools and steel and every. 
thing else that have made this nation 
the industrial 
(MSG-11). addition of 
the time element was used to fulfill this 


leader of the world” 


Elsewhere the 


need for amplification: “That deter- 
mination of ours, extending over all 
these years, was proved. .. .” (FF-5). 


In another instance, the word “roar” 
even when preceded by a highly signifi- 
cant pause did not seem to offer pho- 
netic scope for the size of his idea, and 
FDR held attention on its significance 
by pointing at it verbally: “This new 
roar—for that’s the best term—is the 
best evidence in the world” (Vic-9). And 
once he spelled out a key word as well 
as repeated it to build force: “They were 
botched b-o-t-c-h-e-d they were botched 
by a Republican Administration” (TU- 
43). At one point he introduced the 
word “why” as an expletive, as though 
verbalizing thereby a scope for his idea 
not sufficiently allowed by the Reading 
““Ton’t leave the task of 
making the peace to those old men... 
why just turn it all over to us’” (TU-15). 


Copy words: 


FDR often supplied additional words 
at the outset of sentence or clause to 
serve a general leading-in, introductory 
purpose; the most commonly added 
word at the sentence opening was “and.” 
In these examples the addition of the 
word “and” provided a malleable open- 
ing syllable and allowed the building 
up of force for the important word that 
followed: “And they are playing politics 
with the national security of America 
today” (MSG-30); “And tonight I’m go- 
ing to talk to you about the boxscore 
of the government of the United States” 
(Pitts-1). But this introductory function 
was also often served by more repetition 
of the opening words of the sentence; 
these examples show Roosevelt winding 


up for more vigorous utterance: “The 
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this 
great Nation, solvent, sound in credit, 
..” (Pitts-42); “The 
first thing the first thing they did was to 
try to eliminate the battleships” (MSG- 
42). 


Government the Government of 


is coming through . 


At the beginning of clauses within 
sentences the “that” often 
added to play this introductory role, as 
: still 
think that you ought to be good boys 
and stay out of politics” (TU-24). Again 


word was 


in this example: “candidates 


the repetition of modifiers or preposi- 
tions in a series provided this opening 
element: “when the and the 
values and the transactions of the coun- 
(Pitts-7); 
“it is the same record of timidity, of 
weakness and of shortsightedness” (MSG- 
64). 


incomes 


try are on the down-grade’”’ 


Often in presenting a series orally 
FDR reached a step farther back in the 
structure of the sentence to project the 
parallel elements more vigorously; for 
example, “You and I and everybody else 
saw millions out of work, saw the busi- 
ness concerns running in the red, saw 
the banks closing” (Pitts-5); and “It is 
necessary to mobilize resources, to mo- 
bilize minds and skills” (MSG-g4). 


Just as FDR ad-libbed to provide 
words to serve a building-up function, 
so also he added elements to round out 
the vocal emphasis which he had built 
up for ideas; these words seem added 
as a sort of retiring, subsiding action: 
“In fact, it looked so much like a shut- 
out for the team that you voted a change 
in management” (Pitts-3); “they keep 
this business up usually for three years 
and six months in a row” (TU-3); “And 





from 
so long as it 
democracy work 


continues to make modern 


if I have aught to say about it, it will 
continue” (Vic-7); “Modern science .. . 
knows how to produce as a by-product 
the blessing of cheaper electric power 
for the Nation” (Vic-50). 


Not only did FDR add elements to 
put vocal emphasis where he, as he 
spoke, wished to put it, but he elim- 
inated syllables that might have de- 
feated his intended stress: ““To balance 
our budget in 1933 or (19)34 or (19)35 
would have been a crime against the 
American people” (Pitts-16). Further- 
more, FDR at times enlarged an idea 
sufficiently by and 
pause alone, thus making unnecessary 
words which might otherwise have car- 
ried the emphasis: “The people (them- 
selves) understood that in 1933” (Pitts- 
22); “Listen to this (statement) for in- 
stance: (I quote)” (MSG-21). Elsewhere 


vocal intonation 


he seemed to have spent his force be- 
fore completing the Reading Copy sen- 
tence and thus omitted the tapering-off 
modification: “we move forward with 
God's help to the greatest epoch of free 
achievement by free men that the world 
has ever known (or imagined possible) 
(TU-56). 

Thus, in the act of speaking, Roosevelt 
added or deleted elements as he strove 
for a verbal instrument capable of ex- 
pressing the ideas with the strength and 


scope which he desired. 


Motivation 


During presentation of the addresses, 
Roosevelt heightened the affective pow- 
er of his ideas by several forms of verbal 
change. Sometimes the increase came 
a word or 


from the substitution of 


phrase of stronger connotation: 


to 
so long as it continues to make a modern 
democracy succeed (Vic-6) 
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He invigorated a sentence, changing 
from 


Please God, may that spirit never prevail in our 
land 


He roughened a final admonition 
from 
foreign boondoggling, if you will 


Sometimes he strengthened feeling by 
spelling out commonly desired goals, 
as when he followed the Reading Copy 
pronouncement that “there has been 
no entanglement” with the prediction 
and promise for the future, “and there 
will be no entanglement” (MSG-75). 

FDR further introduced verbal asides 
which evoked or permitted overt expres- 
sion of the listeners’ feelings. On the 
one hand, he animated them by voicing 
a matter of common rejoicing as, for 
example, in the successful prosecution 
of the war: “And I may say that those 
Commanding Generals are making good 
in a big way” (TU-18). On the other, he 
aroused them by citing a matter of com- 
mon derogation: shouts and a cry of 
“You tell ’em, Frankie” followed his 
punch-like line regarding taxes, “To 
pile (on) vast new taxes would get us 
nowhere because values were going 
down, and that makes sense too” (Pitts- 
9); he lifted the booing and laughter to 
further heights by repeating with great 
sarcasm of tone the element they had 
enjoyed, “Senator Taft—the runner-up 
the runner-up” (MSG-25); he followed a 
jibe which had brought applause and 
shouts with an ad-libbed element of 
great belligerence, “for I am not through 
yet” (Vic-45), and drew more laughing 
and whistling. The well-known Fala 
story was further tuned to the au- 
dience’s wild appreciation by ad-libs 
throughout: 


from 
These Republican leaders have not been con- 
tent with attacks upon me, or my wife or my 
sons—they now include my little dog, Fala. 





to 
Pray God that, having discovered it over here, 
we won't forget it either (MSG-67). 


to a savage 
foreign boondoggling, if you like it (Pitts-28), 


Often Roosevelt augmented the sense 
of immediate involvement by his spon- 
taneous changes. Sometimes the ad-lib 
belabored an idea to bring it more dy- 
namically to the consideration of the 
listeners, as in the matter of time: “But 
last fall—this fall, rogo, they had their 
chance” (MSG-49); Administra- 
tion—this one—today—is responsible” 
(TU-32). Sometimes the ad-lib returned 
an idea discussed earlier to their atten- 
tion sharply: “We must set targets and 
objectives for the future which will 
seem impossible—like the airplanes—to 
those who live in. . 


“this 


the dead past” 
(TU-53). Elsewhere he made direct re- 
quests for their consideration of an idea: 
by ad-libbing complete sentences, “Re- 
member, a number of years ago, there 
was a book Mein Kampf written by Hit- 
ler himself” (TU-31), and “Well, let's 
take some simple illustrations that come 
to mind” (TU-32); by inserting ele- 
ments, as “look at their names” (TU-37), 
or rephrasing ideas more vigorously, as 
in the substitution of ‘““Turn another 
page” for “one more example” (MSG- 
49). At times he asserted their real in- 
volvement by such inserted elements as 
these: “We knew and you knew in 
March, 1933, that it would cost money” 
(Pitts-21), and “Something had to be 
done, we all know that” (Pitts-11). 

At many points FDR ad-libbed ele- 
ments which served to informalize the 
message still further. At times this was 





to 
These Republican leaders have not been con- 
tent with attacks on me, or on my wife, or on 
my sons—no, not content with that—they now 
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Unlike the members of my family, he resents 
this. Being a Scottie, as soon as he learned 
that the Republican fiction-writers had con- 
cocted a story that I had left him behind on 
an Aleutian Island and had sent a destroyer 
back to find him—at a cost to the taxpayers 
of two or three or twenty million dollars—his 
Scotch soul was furious. He has not been the 
same dog since. 
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include my little dog, Fala. Well, of course, } 
don’t resent attacks, and my family don’t re- 
sent attacks, but Fala does resent them. You 
know—you know, Fala is Scotch and being a 
Scottie, as soon as he learned that the Re- 
publican fiction-writers in Congress and out 
had concocted a story that I'd left him behind 
on an Aleutian Island and had sent a destroyer 
back to find him—at a cost to the taxpayers of 
two or three or eight or twenty million dol- 
lars—his Scotch soul was furious. He has not 
been the same dog since. (TU-38) 





done by yielding the audience con- 
fidence, as in “‘oh, I can’t understand it, 
I don’t believe you can either” (TU-22), 
and in “my, it seems a long time ago” 
(TU-13). Elsewhere he brought the me- 
chanics of speech-making forthrightly 
into view, as when he ad-libbed ““To use 
the old example that’s always good” 
(MSG-g9), or called informally upon 
authorities, “You see, I’m talking by the 
book” (MSG-44), or used a prominent 
name familiarly, “And I think it was a 
kinsman of mine, about go years ago, 
who invented the word, ‘weasel words’ ” 
(MSG-4). Often he obtained increased 
informality by ad-libbed uses of collo- 
quial language: “Can you beat that?” 
(TU-23); “You know, they pop up all 
the time” (TU-34); “I’ve learned by now 
to take it on the chin” (MSG-79). 

Roosevelt heightened the emotional 
impact of his ideas as he addressed his 
audience by substituting and adding 
elements which evoked popular feeling, 
augmented the sense of immediate in- 
volvement and increased further the 
close rapport between speaker and lis- 
teners. 


Composition 


Certain types of changes made during 
the presentation of the speeches may 
profitably be considered as merely com- 
positional, although they also serve the 


general aim of vigorous communica- 


tion.? For example, the Reading Copies 
were consistently prepared without the 
informal pronoun-verb contraction; 
FDR almost as consistently altered these 
forms as he spoke: I am becomes ]’m; it 
is, it’s; we had, we’d; who have, who’ve. 
The informalizing effect of such con- 
tractions is well recognized; much less 
attention has been given to the fact that 
this practice allowed him to move 
quickly into the succeeding phrases and 
to be in step with their movement and 
nature; for example, the replacement of 
“We have all seen” accelerated the be- 
ginning of this sentence: “We've seen 
many marvelous stunts in the circus, but 
no performing elephant could turn a 
handspring without falling flat on the 
back” (TU-11). 

Another alteration clearly related to 
ease of sentence-movement is the vocal 
substitution of that for which; this al- 
teration occurred twenty-one times in 


7In a general sense all ad-libs could be 
classified as compositional, since all are altera 
tions of phraseology to enable the speaker to 
communicate ideas and attitudes more fully. 
Nevertheless, certain changes are unique to 
their context, arising out of the ideas to be 
communicated, and serving purposes of clari- 
fying relationships, giving concreteness, pro- 
viding force or adding motivation; others, how- 
ever, though also serving these same purposes, 
take again and again the same forms and estab- 
lish themselves as a frequently-found type of 
change. Therefore, the content-centered group 
has been already considered above under 
appropriate headings; the form-centered group 
will now be taken up as compositional changes. 
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the five speeches under consideration.® 
Repeated listening of FDR’s transcribed 
addresses gives one the impression that 
the phonetic components of the word 
that allow for much greater adjustment 
than those of the word which; that can 
be tucked in vocally under other words 
and thus allow the stress to fall on them, 
but which is not capable of such swift, 
easily-modified production.® Clearly the 
superior phonetic flexibility of the one 
word is the reason for its substitution in 
sentences like this: “Compare the score- 
board that (which) you have in Pitts- 
burgh now with the scoreboard that 
(which) you had when I stood here at 
second base in this field four years ago” 
(Pitts-36). 

Another instance of FDR’s substitu- 
tion for the sake of vocal emphasis oc- 
curs in his interchange of the words on 
and upon. It appears that when he 
wished to place particular stress upon 
the substantive words of the phrase he 
minimized the relational words, substi- 
tuting on for upon.® In five cases in 
these addresses FDR _ substituted on 
orally for the upon shown in the Read- 
ing Copies; for example, “the crushing 


8 Although FDR sometimes wrote word 
changes into the Reading Copy with his pen, 
at no point in these five addresses had he 
so struck out the which and written in a that; 
these were spontaneous oral changes. 

9Indeed, with [,] relatively incapable of 
reduction to a more relaxed vowel in compar- 
ison with [ze], and the complexity of the clos- 
ing consonant|t {J in comparison with the 
simplicity of [t], one can see that FDR would 


be able to serve his exquisite sense of timing 
more adequately with that than with which 
An auditory analysis of Roosevelt’s Address to 
the Congress on January 3, 1936, noted his 
“shortening of unemphasized material.” Charles 
H. Voelker, “A Phonetic Study of Roosevelt,” 
The Quarterly Journal of Speech, XXII (Oc- 
tober 1936), 367. 

10 H. W. Fowler has explained that if euphony 
would cause upon to have two unaccented 
syllables instead of being produced with a clear 
6, then on is usually preferred. A Dictionary 
of Modern English Usage (Oxford University 
Press, 1950), p. 683. 


load the debt will impose on (upon) 


your children and mine” (Pitts-37); 
“The happiness of future generations of 
Americans may well depend on (upon) 
how effective and how immediate we 
can make our aid felt” (FF-56); “a plan 
based on (upon) the wishes of the sol- 
diers themselves” (TU-35). But he also 
varied the words in the opposite man- 
ner. In one case Roosevelt had written 
into the Reading Copy by his own hand 
a sentence closing with the words “an 
(MSG-18); nevertheless, 
in speaking, he substituted upon for the 
written on, and used full stress on the 


attack on = us” 


word’s final syllable. Clearly he made 
spontaneous alterations to allow him- 
self the vocal emphasis he desired. 


A frequent change related to sense 
rather than to sound involves the words 
this and that (also these and those). He 
shifted from this to that and from these 
to those to set aside a completed thought 
and move into a succeeding idea. For 
example, he refocused attention by such 
substitution: “Contrast those (these) ex- 
penditures and what we got for them 
with certain other expenditures” (Pitts- 
28). At other times a deprecatory force 
seemed to underlie the substitution as 
if removing the undesirable to a greater 
distance: “six billion dollars of increased 
assets to set off against those (these) li- 
abilities” (Pitts-33). He changed also in 
the opposite direction, as, for example, 
when he drew an already mentioned 
element more clearly into focus: ““To- 
night I want to begin with you a dis- 
cussion of this (that) crisis” (Vic-2). 
Thus, in his reading, FDR shifted these 
words to produce the focus of attention 
he desired or to indicate his approval or 
rejection of an idea. 

Concern over clarity of ideas is again 
shown in the frequent oral repetition 
of a word or phrase before a modifying 
element. FDR read the word or phrase 





be 


acl 


in! 


in; 
pr 


hi: 


sp 
Sal 
th 
In 
stl 
fu 
m 
to 
m. 
tic 
th 
dt 

to 
sil 
el 
m 

co 


di 








in its place in the basic structure of the 
sentence, and repeated it spontaneously 
to form an anchor for the ensuing mod- 
ification. Examples are numerous: “For 
what we send abroad, we shall be re- 
paid, repaid within a reasonable time 
following the close of hostilities, repaid 
in similar materials” (FF-51); “And now 
we face another crisis—a crisis of a dif- 
ferent kind but fundamentally even 
more grave” (Vic-2). And, further, when 
FDR introduced an impromptu modifi- 
cation of any significant length he often 
brought the framework of the sentence 
back into focus by repetition. Thus, 
FDR seemed intent upon holding the 
basic structure of the sentences clearly 
ad-libbed to 


achieve this goal more fully. 


before his audience and 


Therefore, by compositional changes 
introduced at the moment of speaking, 
Roosevelt secured a text aimed at allow- 
ing the desired vocal emphasis and at 
providing clarity of ideas. 

Roosevelt continued the tailoring of 
his text as he spoke; what had been so 
carefully prepared was altered in the 
speaking situation by changes of the 
same five types as had been made during 
the preparation of the Reading Copy. 
In the act of utterance Roosevelt was 
striving to 


clarify relationships _ still 


further, to render 


his coverage even 
more inclusive. He made few additions 
to add concreteness, but a very great 
many to add force to his communica- 
tion and also a great many to heighten 
the responses of his listeners. He intro- 
duced compositional changes that served 
to allow him the vocal emphasis he de- 
sired and to stress the basic thought 
elements which he deemed necessary for 
making his meanings immediately and 


compellingly vivid to his listening au- 
dience. 
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ORAL COMMUNICATION Ap-LIBS 


But Roosevelt also made spontaneous 
changes directly associated with the act 
of oral communication. To remedy in- 
advertent, infrequent errors of reading 
and speaking, to prevent audience mis- 
hearing, to deal with audience responses 
necessitated unfore- 


—these purposes 


seen alterations of a dozen kinds. 


Corrections for bobbles 

In some two and one-half hours of 
oral presentation (the five speeches com- 
bined) it might be expected that the 
speaker would experience some difficulty 
in mere accuracy of utterance, would 
get his “tongue twisted” a number of 
times. There are, however, but four 
such instances in the whole compass of 
these addresses. None occurred in the 
Free- 
doms speech, but either one or two ap- 


Pittsburgh address or the Four 


peared in each of the other speeches: 
“wide-sped widespread refusal” (Vic- 
56); “crystal crear clear” (MSG-26); “by 
the statesmen statements of men” (TU- 
18); and “such as that old warm worm- 
eaten (TU-38). In all of 
these cases the correction was uttered 
with little if any break in rhythm and 
none in poise." 


chestnut” 


There are also a few examples of 
seemingly unintended repetition. He 
seemed to bounce on words at times: 
“I I asked the Congress” (MSG-33), and 


11 From his auditory study of the one address 
Voelker declared: “His feeling for rhythm in 
speech is quite exceptional. His sense of 
rhythm is probably related to his sense of 
style. The rhythm is not lost when he makes 
mistakes. His corrections are exactly on the 
beat. That is a good criterion for judging how 
ingrained is his sense of rhythm. Furthermore, 
if the cadence is determined by the pace of a 
phrase before a pause, the first emphasized 
word after the pause is exactly on the beat. 
If the pause is lengthened by applause, this 
still holds true. Of course, the nature of the 
applause in the interval will vary the cadence. 
His perfection in this is such that it would 
seem that he had been keeping time during 
applause.” Loc. cit. 
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“people themselves realize the realize 
the increasing urgency” (Vic-51). Else- 
where he seemed to have come sooner 
than he expected upon a word or phrase 
and the repetition was not to be pre- 
vented, as in “have been with us with 
us” (TU-17) and “to increase our na- 
tional defense defense’”’” (MSG-34). In 
one instance he repeated an unfinished 
word: “for a def- definite reduction of 
farm tenancy” (Vic-36). Though not 
bobbles, these are irregularities of vocal 
production and changes from the Read- 
ing Copy. 


Correction of misreadings 


One might also expect a considerable 
number of changes to correct misread- 
ings; it is true, however, that FDR read 
with remarkable accuracy. There are a 
few cases of transposition of words: 


from 
who have already laid 

from 
I believe they are certainly among the ablest 
men 


Infrequently he omitted a significant 


it up, as in these instances: 
from 
And from where I stand it looks 


from 
These very men are now asking 


and “candidates who burst out in elec- 
tion-year election-year affection for so- 
cial legislation” (TU-24). In a few in- 
stances Roosevelt altered the text by 
anticipating later phraseology. For ex- 
ample, he replaced “we” with the words 
“you and I and everybody else” perhaps 
drawn from a sentence two paragraphs 
later (Pitts-5); later he substituted for 
“no one” the specification “nobody with 
any sense of responsibility,” likely an- 
ticipating the idea of governmental re- 
sponsibility explained 
later (Pitts-17). 
Elsewhere he replaced immediately a 
misread word with the correct one, as in 
these instances: “the way to ge- win 
votes (MSG-37), “we oppose forc- en- 
forced isolation” (FF-4), “factory owners 
and shop owners form the background 
the backbone” (Pitts-13), and “an or- 


to 
who have laid already (TU-15); 

to 
I certainly believe that they are among the 
ablest men (MSG-12).12 


four sentences 


word and repeated the clause to pick 


to 
And where I stand and from where I stand it 
looks (Pitts-1); 

to 
These men these very men are now asking 
(TU-14). 


On another occasion he enforced his correction with specific words: 


from 
When the first World War ended, we were 


There are several examples of misread- 
ing a word as a noun, then repeating it 
to make it the adjectival modifier it 
was meant to be: “And they too will 
resent this campaign this campaign-talk” 
(TU-17); “efficiency in defense in de- 
fense preparations of any kind” (FF-57); 


to 
Remember that when the World War, I mean 
the first World War, broke out, we were (MSG- 


15). 


derly peacetime mobilization demobili- 
zation” (TU-45). Two examples of mis- 
reading from the Teamsters’ Union ad- 
dress are of more serious nature. In one 

12 These changes are considered as misread- 
ings rather than purposeful alterations because 


repeated listening to the speech recordings re- 
veals the passages as rather weakly read. 
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instance he hesitated, ad-libbed and re- 
peated as though he had momentarily 
lost his place in the manuscript: “There 
was some—in the Congress and out— 
who raised their voices against our prep- 
arations for defense—before and after 
1939, objected to them, raised their voices 
against them, as hysterical war mon- 
gering” (TU-13). In another instance he 
seemed to waver on a phrase as though 
he had lost the train of thought and 
then repeated to continue: “Our mil- 
lions of soldiers and sailors and mer- 
chant seamen have been handicapped or 
prevented from voting by those _pol- 
iticians prevented from voting by those 
politicians and those candidates who 
...” (TU-26). These repetitions do not 
harm the meaning, and only their in- 
tonation reveals them as misreadings 
rather than as attempts to add force. 

There are a few instances where the 
misreadings apparently escaped his no- 
tice: 1949 for 1941 (TU-13), 41 for 43 
(Pitts-6), policy for problems (Vic-58), 
course for force (MSG-g95), Congress for 
policy (MSG-31). Elsewhere he seems to 
have noticed his irregularity and showed 
by his pause that he thought it unnec- 
essary to change, as when he read “per- 
fectly” for “certainly” in this sentence, 
“The record is certainly clear” (MSG- 
43). On the other hand, he lost the 
meaning of a sentence when he misread 
“no” for “the” and seemed to weaken 
vocally as he continued without attempt 
to readjust: “There is no (the) task of 
finishing victoriously this most terrible 
of all wars” (TU-40). On another occa- 
sion he seems to have skipped a line; the 
Reading Copy had these words: 

Without these spiritual forces we cannot 


make or maintain peace, and all three of them 
work with us toward that great end. 


But FDR said this: 


Without, without these three, all three of 
them, without their working with us toward 
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that great end,.things would not be as clear or 
as easy (MSG-96). 

It is interesting to note how often a 
misreading or a bobble is followed 
rather soon by another irregularity, 
either a mistake or an ad-lib. The Pitts- 
burgh address has a misreading, then 
an ad-libbed caution against misunder- 
standing; the Victory Dinner speech has 
a bobble, then a misreading nine sen- 
tences after the bobble. In the Team- 
sters’ Union address a bobble is fol- 
lowed by a completely ad-libbed sen- 
tence; there is also a bobble in the Fala 
paragraph (intensely dramatic and suc- 
cessful as that was), a misreading in the 
eighth sentence thereafter, then a spell- 
ing out in letters the word botched in 
the second sentence after that. Momen- 
tary faltering and a second error are re- 
actions which one might expect in a 
speaker, but following an error with an 
ad-libbed element is behavior revealing 
remarkable confidence and skill. Even 
more noteworthy, however, is the ex- 
tremely small number of errors of any 
type that occurred in these five im- 
portant addresses. 


Recovering from audience blurring of 

words 

When audience response broke into a 
sentence, FDR often repeated enough 
of the sentence to provide a basis for 
understanding the remainder: “In fact, 
it looked so much like a shut-out for 
the team that you voted a change in 
management in order to give [applause | 
in order to give the country a chance 
to win the game” (Pitts-3). Further, he 
handily supplied words to make explicit 
a relationship that would have been 
clear had not the audience response 
blurred his words: “Not holding com- 
pany executives or lawyers, [growing 
applause] I’m talking about men ex- 
perienced in actual production” (MSG- 
11). Not many examples of such blurring 
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occurred, however, for Roosevelt was 
for the most part in charge of the ap- 
plause of his audiences, calling it forth 
almost at will by the play of his voice, 
inviting vigorous response strongly and 
surely by subtle, sudden combinations 
of pitch, force and rate. 


Taking pains to avoid mishearing 


Whe Roosevelt spelled out the word 
botched (‘“They were botched—b-o-t-c-h- 
e-d they were botched by a Republican 
Administration” TU-43), he seems to 
have been ad-libbing more for emphatic 
effect and audience enjoyment than to 
avoid confusion in understanding. In 
another sentence, however, a direct ref- 
erence to spelling does seem to have been 


aiming at clarity: “you and I—all of us 





together—were making forty-one billion 
—spelt with a ‘b’ not an ‘m’—less dol- 
lars in 1932” (Pitts-6). There is little 
doubt even here, of course, but that 
Roosevelt was fully aware of the added 
satiric effect he was achieving. 


Handling audience responses 


FDR responded directly to the audi- 
ence’s eager reactions, making effective 
use of his need for restraining them: for 
example, he held back applause that 
was beginning, “But wait, wait, it’s been 
.” (Pitts-35), and he 
halted the booing so that he could elicit 
an even greater response with his punch 
line, “now wait, a perfectly beautiful al- 
literation—Congressmen Martin, Barton 
and Fish’ (MSG-53). 

He also ad-libbed to terminate audi- 
ence responses. To conclude the vigor- 
ous applause which he had elicited by a 
pugnacious thrust he ad-libbed “and 


fifty-five years 


so” with a rising inflection and a com- 
manding tone as he began the follow- 
ing sentence: “And so on top of having 
to meet the ordinary expenses of the 
government .”’ (Pitts-10). Shortly 





thereafter he quieted the applause with 
“Yes, and I 
needs 


refused to leave human 
... (Pitts-14). Sometimes he de- 
manded their attention by repetition 
of an element present in the Reading 
Copy (as he bypassed boos for Hoover's 
administration by repeating increased 
the debt, Pitts-24) or of an ad-libbed 
one (as he acknowledged the cheers of 
his “old friends’ but moved into the 
next statement, “You know—you know, 
this is not the first time’ TU-1). Thus 
he often provided expendable words 
with which to quiet his audience so that, 
having utilized the effect of their in- 
terruption, he might then proceed. 


These corrections for bobbles and 
additions to deal 
with audience blurring of his words, to 
hold back or terminate 


sponses and to 


misreadings, these 
audience _re- 
prevent listener mis- 
understanding—these ad-libbed elements 
are evidences of the interaction between 
vigorous audiences and a speaker of 
great sensitivity and skill. 

It is clear then that two types of ad- 
libbing were occurring in these speeches: 
those alterations which continued the 
general processes of careful speech prep- 
aration, and those which arose from the 
speaker’s immediate interaction with 
his listeners. It is interesting to note that 
Roosevelt was herein obeying certain 
well-known precepts of public speak- 
ing: (1) that the actual labor of crea- 
tion should be going on vigorously dur- 
ing the act of speaking, the speaker 
seeking to express by word and voice the 
fullness of an idea, its relationships, its 
purpose; (2) that the speaker should, 
during the act of speaking, so impassion 
himself with his meaning that he seeks 
spiritedly for words that allow him 
suitable vocal and connotative play for 
his swelling thoughts; (g) that the 
speaker should, as he speaks, so involve 
himself with his listeners’ responses that 
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he seeks actively for words that will 
keep before them vividly the ideas they 
must not miss, that will focus their at- 
tention and heighten their feelings. 

Did these ad-libs alter greatly the mes- 
sage Of the addresses? No, probably 
not;'® but they did alter greatly the im- 
pact of those messages, for they allowed 
Roosevelt to relate his ideas and him- 
self to his listeners in more vibrant, 
compelling terms. 


SPECULATION 

Mindful of the careful labor of text- 
preparation in which FDR and his as- 
sistants regularly engaged, one feels im- 
pelled to ask: why did Roosevelt ad-lib 
so much? It is true that he as any other 
speaker had to recover from bobbles 
and misreadings; but when this has 
been considered it accounts for but a 
percentage total 
changes that FDR made in speaking 
from 


small indeed of the 


these carefully texts. 


Most of the changes were voluntary and 


prepared 


spontaneous.'* Why did he make them? 
The best answer seems to be: because he 
enjoyed it so much. 

One might reasonably ask where FDR 
might have gained the ability to ad-lib 
so easily and confidently as to afford 
him such a measure of pure joy and 
amusement. Certainly his long hours 
daily on the telephone’® and in face-to- 
face conversation with men of widely 
differing outlook gave him vast experi- 
ence. His frequent press conferences— 


18 Rosenman pointed out that Roosevelt got 
himself into remarkably little trouble through 
them. Op. cit., p. 486. 

144A somewhat different type of spontaneity 
sometimes occurred, however. Both Rosenman 
and Sherwood have pointed out that if FDR 
strongly wished to use a term or idea which 
his assistants vigorously opposed he might give 
in to its elimination from the text but hold it 
in mind and put it in at the time of delivery 
Rosenman, 483-484; Sherwood, 218. 

15 Gunther suggested that about one-fourth 
of his work day was spent in telephoning. John 
Gunther, Roosevelt in Retrospect (New York: 
Harper & Bros., 1950), p. 125. 
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nearly one thousand in all with 1 to 2 


hundred questioners _ present'*—gave 
him readiness with words and phrases 
and much opportunity to observe re- 
actions to terms and tones at relatively 
close range. But these are matters of 
skill, and the true sources of FDR’s en- 
joyment in ad-libbing seem to lie more 
deeply—in factors of personality, in 

These three 
seem significant: his sensitivity to the 
forces about him; his love of experimen- 


characteristic attitudes. 


tation, improvisation, change, mystery; 
and his pleasure in oral creativity. 


Sensitivity 

FDR understood and liked people, 
individually and in groups.’7 He was 
able to their involvement and 
their degree of acceptance of his ideas 
and of himself; he was further able to 
gauge accurately what would heighten 
their feeling and give them further en- 
joyment. Constant 


assess 


practice over the 
years had sharpened this perceptiveness 
and his confidence in his power. 


Love of experimentation 

Roosevelt’s “quicksilver mind,’’* his 
“nimbleness’’?® tended to keep a play of 
alternatives before He loved to 
represent himself as a magician who 
could amaze and amuse his audience by 


“pulling another rabbit out of a hat.’’?° 


him. 


16“The President knew most of those who 
attended regularly on first-name or nickname 
terms; he was like a friendly, informal school- 
master conducting a free-for-all seminar, and 
indeed the chief function of the conferences 
was educational. The President would explain 
an issue, take it apart, and present it over and 
over again from a new angle; he tackled the 
exposition of a difficult piece of legislation al- 
most like an engineer demonstrating a new 
model of some machine.” Gunther, pp. 135-136. 

17 He was “easy in his way with crowds” 
(Burns, p. 281); he was “insatiably curious about 
people and their ideas, sensitively attuned to 
the play of forces around him” (J/bid., p. 403). 

18 [bid., p. 475. 

19 Gunther, p. 114. 

20 Sherwood, p. 73. Gunther pointed out that 
he “loved to ‘pull’ surprises and put on a 
show.” Op. cit., p. 53- 
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This love of movement from the ex- 
pected, this enjoyment of change, this 
delight in improvisation—these traits 
were deep in FDR. Burns has made the 
excellent suggestions that it was_ his 
“powerful sense of belonging” which 
gave him a sense of footing from which 
he was free to try the new, the never- 
before-dared frontier. 


Enjoyment of oral creativity 

Winans pointed out, as we all re- 
member, that the effective speaker must 
have full realization of the content of 
his words as he utters them and a lively 
sense of communication. Roosevelt not 
only fulfilled these obligations but 
deeply enjoyed performing them. He 
seemed to take great pleasure in making 
everything abundantly clear;?* he en- 
joyed the apt word, the memorable 
phrase; he knew instinctively the timing?* 
and the tone that would give them most 
expressiveness. Indeed he thought of 


21 Op. cit., p. 78. 

22 Burns explained that “His mind yearned 
for the detail, the particular, the specific.” Op. 
cit., p. 334- 

23 Gunther, pp. 236-237. 


himself as an excellent actor, and truly 
he was.?# His audience heard a man re- 
acting vigorously to his ideas and invit- 
ing them powerfully to comprehend and 
be appropriately moved also, and no 
hearer could mistake his enjoyment of 
the whole process. 


It is not hard to understand that such 
a man—sensitive to the feelings of the 
crowd and at home before them, ex- 
perimental in his approach to ideas and 
thus only partially bound by the text 
before him, orally creative and delight- 
ing in the exercise of his talents—would 
have ad-libbed frequently and have en- 
joyed it hugely. Whether these altera- 
tions came as mere continuation of the 
usual efforts of his speech preparation 
or as a function of the interaction be- 
tween himself and his audience, they 
were spontaneous adjustments to meet 
the realities of a situation by a speaker 
who not only perceived these realities 
with great clarity but who made con- 
sequent alteration with great ease and 
great enjoyment. 


24 Rosenman, p. 338; Gunther, p. 62; Burns, 


P- 474; Sherwood, p. 218. 
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AN EXPERIMENTAL INVESTIGATION OF 
RHETORICAL CLARITY* 


ROGER E. NEBERGALL 
University of Oklahoma 


HIS paper is concerned with the 
a ee of experimental method 
to the study of one aspect of rhetorical 
style, variously called clarity, lucidity, 
or perspicuity. Writers on rhetorical 
theory from the time of Aristotle have 
usually agreed that the study of rhetor- 
ical style is an important part of the 
study of rhetoric, and that clarity is a 
fundamental quality’ or aspect of effec- 
tive style. Little work has been done in 
the area of style by those interested in 
experimental research in communica- 
tion. So, the general question, “Is it 
feasible to study rhetorical style by 
experimental method?”’, is more appro- 
priate than the more specific question, 
“What are the 
style?” 


elements of effective 
An affirmative answer to the first 
question, of course, would suggest that 
later research could profitably be con- 
cerned with the second. 

Specific questions posed by this study 
in an effort to find out whether style, 
or more particularly clarity, can be in- 
vestigated by the procedures used here 
are: 


1. Do people differ in their ability to con- 
struct “clear” messages? 

2. Is a message “clearer” in written form than 
it is in oral form? 

3. Is the ability to construct a “clear” message 
related to the general verbal ability of an 
individual? 

4. Will messages judged “clear” by the meas- 
ures here proposed also be judged “clear” 
by rhetorical critics? 

5. Will messages judged “clear” by the meas- 
ures here proposed be 


judged “compre- 


hensible” by readability techniques? 


*Based on a Ph.D. dissertation, University of 
Illinois, 1956, directed by Marie K. Hochmuth. 


In order to answer these questions, a 
definition of clarity is needed which will 
make measurement of this quality pos- 
sible. The following definition of clarity, 
which is related to the definitions pro- 
posed by many rhetorical theorists, is 
proposed: 

A message is clear to the degree that a speaker’s 
intended meaning for a message agrees with the 
audience’s obtained meaning from the message. 


A completely clear message, accord- 
ing to this definition, would be one 
from which the audience obtained exact- 
ly the same meaning as the speaker in- 
tended for the message, and the clarity 
of the message would be less than com- 
plete to the extent that the meaning 
which an audience obtained from a mes- 
sage differed from the meaning that a 
speaker intended for it. 
mental application of 


The experi- 
this definition 
presumes, of course, an objective meas- 
ure of meaning. The measure of mean- 
ing employed in this study is Osgood’s 
semantic differential.? 


Since the semantic differential is re- 
garded by those who developed it as a 
logic, rather than as a test in any final 
form,” it was necessary to develop a form 
of the semantic differential to employ 


in the experiment proposed in_ this 


1 The most complete discussion of the 
semantic differential is in Charles E. Osgood, 
George J. Suci, and Percy H. Tannenbaum, 
The Measurement of Meaning (Urbana: The 
University of Illinois Press, 1957). 

2Charles FE. Osgood, “Report on_ the 
Development and Application of the Semantic 
Differential” (Unpublished Monograph, Uni- 
versity of Illinois, Institute of Communications 
Research), p. 14. 
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TABLE I* ra 
THE TWENTY-Four SCALES EMPLOYED IN THE EXPERIMENT AND THE FACTOR LOADING gts 
ON EACH OF THE FACTORS. ver 
Factor I Factor II Factor III we 
(Evaluation) (Potency) (A-tivity) ins 
1. good-bad 88 OF —.09 we 
2. large-small .06 62 34 the 
3. beautiful-ugly 86 .09 O1 
4. hard-soft —.48 55 16 Ab 
5. Sweet-sour 83 —.14 —.0g pu 
6. strong-weak 19 62 .20 
7. calm-agitated 61 .00 —.36 suc 
8. red-green —.33 —.08 35 sag 
g. loud-soft —.39 44 23 
10. deep-shallow 27 .46 14 of 
11. sharp-dull 25 07 52 stil 
12. ferocious-peaceful —.69 17 41 lie 
13. heavy-light —.36 62 —.11 _ 
14. hot-cold —.04 —.06 46 
15. nice-awful 87 —.08 -19 
16. treble-bass 33 —.47 06 of 
17. angular-rounded —.17 08 43 bv 
18. fragrant-foul 84 —.04 —.11 
19. honest-dishonest 85 .07 —.02 ple 
20. active-passive 14 O4 59 on 
21. fair-unfair 83 08 —.07 
22. rugged-delicate —-.42 60 .26 de: 
23. fast-slow O1 .00 .70 sta 
24. clean-dirty 82 —.05 03 
wa 
*From Charles E. Osgood, “Report on the Development and Application of the Semantic ask 
Differential” (Unpublished Monograph, University of Illinois. Institute of Communications 
Research), p. 9. on 
lin 
— : ka on , sen 
study. This was done by selecting the Osgood and Suci.* The D formula is: é, 
. ° . ° rs 
eight sets of polar adjectives showing D = \/sd? rit 


the highest factor loadings on each of ° 
7 8 ' In this formula, d_ represents the § sec 
distance between the two profiles on orc 


each of the scales. On a form of the 


the three factors which have been so 
far identified. The twenty-four sets of 





polar adjectives employed, together semantic differential employing 24 scales, ee 
with their factor loadings are shown in’ fp has a minimum value of o and a 
Table I. ; ie " pu 
maximum value of \/864 or 29.39, since vor 
A form of the semantic differential, the largest possible d on each scale is 6 . 
when filled out by a subject who checks making d? = 36, and on 
one of the seven possible positions on the 24 . 36 — 864. spe 
seven-point scale separating each of the ho 
sets of polar adjectives employed, gen- PROCEDURE gre 
erates a profile of the subject’s meaning Reproductions of five abstract paint- he: 
for the particular concept tested. In the ings on slides suitable for projection sag 
present experiment, the concern is with were secured, and five students were wh 
the relationship of two such profiles, the selected to serve as communicators. ‘Two for 
profile generated by the speaker, and of the communicators selected were : 
the profile generated by a member of 8 Charles E. Osgood and George J. Suci, “A dr 
an audience. These profiles can be com- Measure of Relation Determined by Both Mean the 
Difference and Profile Information,” Psychologi- 
pared by the D method developed by cay Bulletin, XXXXIX (May, 1952), 251-62. au 
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graduate students in speech at the Uni- 
versity of Oklahoma; the other three 
were undergraduate students at the same 
institution. In the experiment, they 
were asked to describe the meaning 
they perceived in the abstract pictures. 
Abstract paintings were selected for this 
purpose since it was felt that the use of 
such stimuli would insure that the mes- 
sages would provide as much as possible 
of the available information about the 
stimulus for the members of the au- 


diences employed. 


Each communicator was shown each 
of the pictures (projected on a screen 
by a slide projector) and asked to ex- 
plain his meaning for it, “so that any- 
one hearing your explanation will un- 
derstand the picture just as you under- 
stand it.” Two or three days later, he 
was shown the pictures again and was 
asked to record his meaning for them 
on the semantic differential. With the 
limitation that each picture was _pre- 
sented to one of the communicators 
first, to another second, etc., the pictures 
were presented in random order. The 
second presentation of the pictures was 
ordered the same as the first. 


The messages obtained in this fashion 
were literally transcribed. Appropriate 
punctuation was inserted and random 
vocalizations omitted. 


Ten audiences of 20-25 persons each, 
consisting of sections of the beginning 
speech course at the University of Okla- 
homa, were arbitrarily divided into two 
groups of five audiences each. One group 
heard the recorded versions of the mes- 
sages, and the other read the messages 
when they were presented in written 
form. 


Two Latin squares were randomly 
drawn, governing the order in which 
the messages would be presented to each 
audience. The procedure followed with 


each audience for the recorded versions 
of the messages was as follows: 

A tape was prepared with the five 
messages that the particular audience 
was to hear. The of that 
class introduced the experimenter, and 
the experimenter instructed the group 


instructor 


in the procedure of the experiment and 
the use of the scales of the semantic 
differential. The first message was played 
on a tape recorder and the audience 
filled out the semantic differential for 
that message. Then the next message wa 
played and the procedure repeated. The 
total time required to administer the 
test for the five messages was about 25 
minutes. 

A similar procedure was used for ad- 
ministering the test with the written 
form of the message except that the 
printed copy of the message was distrib- 
uted and was available while the suh- 
jects were differentiating their mean- 
ing for it. Thus, for the oral form of 
the message, the subjects had to recall 
the message as they were filling out the 
scales of the semantic differential; for 
the written form, they could refer to the 
message while filling out the scales. It 
was felt that this provided a realistic 
comparison of the oral and written 
forms, the fundamental 
oral and written 
communication is that a reader has an 
opportunity to re-read the communica- 
tion while the listener has an oppor- 


since one of 
differences between 


tunity to hear the communication only 
once. 


Various other measures were then 
gathered. Scores for all audience mem- 
bers on the Ohio State Psychological 
Examination obtained. Also, 
twelve rhetoricians were asked to judge 
the messages as to their “clarity.” This 
was done by asking them to record their 
judgment as to the clarity of the message 


by checking the appropriate space on a 


were 
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sé 


seven point scale ranging from “ex- 
tremely clear,” to “extremely unclear.” 

Finally the messages were evaluated as 
to their “comprehensibility” by subject- 
ing them to “‘cloze” analysis. The “‘cloze 
procedure” is a measure of the compre- 
hensibility of messages, developed by 
Wilson Taylor which correlates highly 
with the standard readability measures 
while appearing to avoid some of the 
most obvious defects in them.* For the 
“cloze” analysis, student groups comp- 
arable to the original audiences were 
employed. 


ANALYSIS OF DATA 
The following hypotheses were tested 
in the analysis of the data obtained in 
the experiment: 

1. That there is a difference among the com- 
municators employed in their ability to 
communicate their intended meaning. 

2. That there is a difference in the general 
level of transmission of meaning between the 
oral and written forms of the messages. 

3. That the differences among the messages in 
their effectiveness in conveying a speaker's 
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intended meaning to an audience are related 
to the differences in clarity of the same mes- 
sages as judged by rhetorical critics. 
4- That the ability of a member of an audi- 
ene to obtain a speaker’s intended meaning 
is related to his verbal ability as measured 
by the Ohio State Psychological Examination. 
That the differences among the messages in 


i) | 


their effectiveness in conveying a speaker's 
intended meaning to an audience is related 
to the differences in comprehensibility of 
the same messages as determined by “cloze 
procedure.” 


Differences in Ability to Communicate 
Intended Meaning 

This hypothesis was tested by analysis 
of the 
both the oral and written forms of the 


of variance data obtained for 
mesages. The cell entries in Table II 
are the means of the D scores obtained 
between the speaker’s meaning profile 


for the message, and the profile for the 


message of each member of the au- 
dience, for the audiences which heard 
the oral form of the messages. Table 


III shows the results of the calculation 


TABLE Il 
MEAN AUDIENCE D. 

















Oral Messages 


~ Audience — 

Speaker - « a | IV V Sum 

1 11.31 11.12 12.86 11.09 13.35 59-73 
(Mc) (Md) (Mb) (Ma) (Me) 

2 11.12 11.66 15.85 13.93 10.96 63.52 
(Md) (Mc) (Ma) (Me) (Mb) 

3 13.59 11.95 14.10 17.72 14.56 71.92 
(Mb) (Me) (Mc) (Md) (Ma) 

4 12.78 10.23 13.11 11.24 14.64 62.00 
(Ma) (Mb) (Me) (Mc) (Md) 

5 10.04 8.31 10.27 10.00 9.96 48.58 
(Me) (Ma) (Md) (Mb) (Mc) 

Sum 58.84 53-27 66.19 63.98 63-47 /305-75 








Note: Mx identifies the message generated by 


municator in the same row. 


4Wilson Taylor, Cloze Procedure”: A 
New Tool for Measuring Readability,” Journal- 
ism Quarterly, XXX (Fall, 1953), 415-433; also 
“Recent Developments in the Use of ‘Cloze 
Procedure,” Journalism Quarterly, XXXIII 
(Winter, 1956), 42-48. 


the picture (x) when presented to the com- 


of analysis of variance for the data in 
Table II. Table IV shows the data for 
the received the 
written form of the messages. Table V 


five audiences which 
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TABLE III 

ANALYSIS OF VARIANCE FOR ORAL MESSAGES. 
Sources of Mean 
Variation df Square F 
Columns 

(Audiences) 4 5-31 2.46 
Rows (Speakers) 4 14.12 6.54° 
Treatments 

(Pictures) 4 1.90 — 
Residual 12 2.16 











*Significant at the 1% level of confidence. 


their intended meaning to the audiences 
employed. 


The Oral and Written Forms 

The significant difference between 
audiences evidenced in the analysis of 
variance applied to the written form of 
the message raises some question about 


the nature of the parent population (all 


TABLE IV 
MEAN AUDIENCE D. 











Written Messages 














Au‘tience ma 

Speaker .. | Ill IV Vv Sum 

1 13.57 10.29 10.08 13.74 10.87 58.55 
(Me) (Mc) (Ma) (Mb) (Md) 

2 9-74 11.87 11.88 15.82 11.64 60.95 
(Md) (Me) (Mc) (Ma) (Mb) 

3 13.22 13.60 13.81 17-54 11.32 69.49 
(Mc) (Ma) (Mb) (Md) (Me) 

4 12.50 10.35 13.29 12.76 12.11 61.01 
(Ma) (Mb) (Md) (Me) (Mc) 

5 ; 8.7 9-52 9-27 8.92 9-45 45-92 
(Mb) (Md) (Me) (Mc) (Ma) 

Sum 57:79 55-63 58.33 68.78 55-39 /295-92 








shows the results of the calculation of 
analysis of variance for the data in 
Table IV. It will be noticed that in 
both forms of the experiment the null 
hypothesis that there is no difference 











TABLE V 

ANALYSIS OF VARIANCE FOR WRITTEN MESSAGES | 
Sources of Mean 
Variation df Square F 
Columns 

(Audiences) 4 6.09 3-42° 
Rows (Speakers) 4 14-45 8.12** 
Treatments 

(Pictures) 4 3.21 — 
Residual 12 1.78 








*Significant at the 5 percent level of confidence. 
**Significant at the 1 percent level of confidence. 


between communicators can be rejected 
(p<.o1), and therefore it can be con- 
cluded that there was a significant dif- 
ference between the communicators em- 
ployed in their ability to communicate 


possible sections of the beginning speech 
course at the University of Oklahoma). 
For this reason, it was decided to com- 
pare the results of the oral and written 
forms of the messages by a non-paramet- 
ric statistic which makes no assumptions 
about the nature of the parent popula- 
tion. 

Applying a sign test’ to the speaker 
totals in Tables II and IV we find that 
for all five speakers the total D score is 
greater in the oral form of the test than 
in the written. Using a two-tailed test 
of significance, this difference is sig- 
nificant at about the 2 per cent level. 


Applying the same sign test to the 
individual message scores in the two 
tables, however, we find that seventeen 


5 Helen Walker and Joseph Lev, Statistical 
Inference (New York: Henry Holt & Company, 
1953), P- 430. 
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of the D scores are higher for the oral 
form of the test, and eight are higher 
for the written form of the test. Using 
a two-tailed test of significance, this 
difference is not significant. 

These results are probably not sufh- 
ciently conclusive to enable us to re- 
ject the null hypothesis, and to conclude 
that the speakers’ meanings were bet- 
ter transmitted by the written form of 
the messages than by the oral form. At 
the same time they suggest that a more 
precise analysis might show significant 
differences between the oral and written 
forms of the messages. 


Rhetorician’s Judgments of Message 
Clarity 

In testing the third hypothesis, the 
sum of D scores for each message in 
both the oral and written forms of the 
test was obtained. This sum of D scores 
was regarded as a measure of the clarity 
of the message. Using this score, the 
messages were rank-ordered as to their 
clarity. The judgments of the rhetorical 
critics on the seven point rating scale 
were summed for messages, with a check 
to the far left of the scale being recorded 
as a 1 and a check to the far right of 
the scale being recorded as a 7. These 
total scores were then rank-ordered. The 
rank-orders assigned the messages by the 
two methods were then compared by 
rank-order correlation. The rank-order 
correlation obtained was —.33 which is 
not significantly greater than could be 
expected by chance. 


From these results we may conclude 
that we are unable to reject the null 
hypothesis, and unable to show that 
the differences among the messages in 
their effectiveness in conveying a speak- 
er’s intended meaning to an audience 
are related to the clarity of the same 
messages are judged by rhetorical critics. 


O.S.P.E. Scores 

The fourth hypothesis was tested by 
analysis of variance. The question was 
whether the D scores of individuals in 
different O.S.P.E. 
garded as random samples from the 


deciles could be re- 


same population, or whether the mean 
D for individuals in one O.S.P.E. decile 
might differ from the mean D for in- 
dividuals in another O.S.P.E. decile. 
Two O.S.P.E. scores (total and reading) 
and two sets of individual D_ scores 
(written and oral) were available. Four 
analyses of variance were computed (all 
possible combinations). The results of 
these analyses of variance are reported 
in Tables VI, VII, VIII, and IX. 
TABLE VI 
ANALYSIS OF VARIANCE OF INDIVIDUAL D Scorers 
AND O.S.P.E. ToraL, ORAL Form. 

















Source of Mean 
Variation df Square F 
Between groups 9 47.85 — 
Within groups 79 56.96 
Total 88 

TABLE VII 


ANALYSIS OF VARIANCE OF INDIVIDUAL D SCORES 
AND O.S.P.E. READING, ORAL ForRM. 











Source of Mean 

Variation df Square F 
Between groups 9 133.00 2.84° 
Within groups 79 46.74 

Total 88 








*Significant at the 1 percent level of confidence. 


TABLE VIII 
ANALYSIS OF VARIANCE OF INDIVIDUAL D SCORES 
AND O.S.P.E. TOTAL, WRITTEN ForM. 

















Source of Mean 
Variation df Square F 
Between groups 9 42.26 —_— 
Within groups 49 52.93 
Total 58 

TABLE IX 


ANALYSIS OF VARIANCE OF INDIVIDUAL D SCORES 
AND O.S.P.E. READING, WRITTEN ForM. 











Source of Mean 

Variation df Square F 
Between groups 9 20.33 ae 
Within groups 49 56.95 

Total 58 
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In these four analyses of variance, 
the results differ from 
chance in only one. In the analysis in- 
volving the D scores obtained from au- 
dience members hearing the oral form 
of the message, and the O.S.P.E. Read- 
ing scores, a difference significant at 
the 1 per cent level is noted. Therefore, 
we may conclude that for the oral form 
of the messages tested here, there is a 
significant 


significantly 


relationship between _ the 
ability of an audience member to obtain 
a speaker’s intended meaning and his 
verbal ability as judged by his reading 
score on the Ohio State Psychological 
Examination. 


Cloze Scores 


The fifth hypothesis was tested by 
rank order correlation. The cloze scores 
obtained for each message were used as 
a basis for ranking the messages as to 
their comprehensibility. The rank-order 
correlation between these ranks and the 
ranks obtained for the clarity of the 
messages in testing Hypothesis III was 
computed. The rank order correlation 
was +.26 which is not significant. 


Thus we may conclude that we can- 
not reject the null hypothesis and are 
unable to establish that there is a rela- 
tionship between the comprehensibility 
of a message as determined by the cloze 
formula, and the clarity of the same 
message as judged by the methods em- 
ployed here. 


DISCUSSION 


Interpretation of Communicator 
Differences 


The nature of the Latin square de- 
sign is such that while the presentations 
of column variables, row variables and 
treatment variables can be randomized 
in presentation, the order of presenta- 
tion of the row variable cannot be ran- 
domized, and therefore the results ob- 


tained for the row variable are con- 
founded with the order of presentation. 
For this experiment, this means that 
while each audience heard one message 
from each speaker which had been 
selected at random (with the limitation 
that each message be heard by one au- 
dience), and one message about each 
picture (with the limitation that a mes- 
sage about a certain picture would be 
presented first to one audience, second 
to another, etc.), each audience heard 
a message from the same speaker first, 
from the same speaker second, etc. 

While this is a limitation of this par- 
ticular design, there is no reason to be- 
lieve that this confounding affected the 
results obtained. Studies that have been 
done on the effects of speaker order have 
reached somewhat ambiguous _ conclu- 
sions, but their conclusions have not 
indicated that the results obtained by 
this experiment could have been pre- 
dicted on the basis of speaker order. 
These studies have indicated that speak- 
er positions other than first and last 
tend to be favored,® so the results in 
this experiment, with the iowest D 
scores obtained for the first and last 
speakers would indicate that this type 
of order preference was not operating 
in this case. 


The Evaluative Dimension 


As will be seen by reference to Table 
I, the form of the semantic differential 
employed in this experiment is rather 
heavily weighted on the evaluative di- 
mension. This is in spite of the fact 
that the twenty four scales selected con- 
sisted of the eight scales with the highest 
factor loadings on each of the three 
dimensions. This is due, of course, to 


6F. H. Knower, “A Study of Rank Order 
Methods of Evaluating Performance in Speech 
Contests,” Journal of Applied Psychology, XXIV 
(October, 1940), 633-644; also, Sam L. Becker, 
“The Ordinal Position Effect,” Quarterly Jour- 
nal of Speech, XXXIX (April, 1953), 217-219. 
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TABLE X 
D ScORES—SPEAKER PROFILE AND AUDIENCE MEDIAN PROFILE. 
Oral Form 
Audience 3 
Speaker I Il Ill IV V Sum 
1 7-21 7-21 6.56 “5.66 8.43 4 4 5.07 
(Mc) (Md) (Mb) (Ma) (Me) 
2 6.24 5-74 8.72 7.87 4-36 $2.93 
(Md) (Mc) (Ma) (Me) (Mb) 
3 8.18 6.40 8.66 10.63 5-57 39.44 
(Mb) (Me) (Mc) (Md) (Ma) 
4 7.00 5-29 5.83 6.71 7-35 32.18 
(Ma) (Mb) (Me) (Mc) (Md) 
5 5.38 1.73 4-47 5.20 4.00 20.78 
(Me) (Ma) (Md) (Mb) (Mc) 
Sum 34.01 26.37 34-24 36.07 29.71 /160.40 








the fact that the factor loadings on the 
evaluative factor are considerably higher 
than the factor loadings on the other 
two dimensions, and also to the fact 
that evaluative elements are found in 
many of the activity and potency scales. 


TABLE XI 


ANALYSIS OF VARIANCE OF 
DATA-MEDIAN PROFILES. 














Source of Mean 
Variation df Square F 
Columns 

(audiences) 4 3.10 2.46 
Rows (speakers 4 9-77 7.759 
Treatments 

(pictures) 4 1.32 — 
Residual 12 1.26 








*Significant at the 1% level of confidence. 


Reference to the text of the messages 
employed in this experiment indicates 
that in some cases small profile differ- 
ences existed between speaker and au- 
dience for messages which appeared to 
consist largely of evaluative statements. 
It could be argued that the differences 
observed between the speakers was not 
due to differences in ability to com- 
municate their intended connotative 
meaning, but to differences in the evalu- 
ative content of the messages. 


To evaluate this possibility, a nine- 
scale form of the semantic differential 
was employed. These nine scales in- 
cluded the three activity and potency 
scales with the highest loadings, and 


TABLE XII 
D ScORES—SPEAKER PROFILE AN’) AUDIENCE MEDIAN PROFILE. 








Written Form 

















Audience P 

Speaker ae i Ill IV Vv Sum 

1 7.81 6.00 5.66 6.40 6.00 31.87 
(Me) (Mc) (Ma) (Mb) (Md) 

2 6.7 7.48 6.40 9.16 5.00 34-75 
((Md) (Me) (Mc) (Ma) (Mb) 

3 8.37 6.16 9.27 10.00 5.66 39-46 
(Mc) (Ma) (Mb) (Md) (Me) 

4 6.78 4-47 6.78 5-47 6.32 29.82 
(Ma) (Mb) (Md) (Me) (Mc) 

5 4-90 4-12 3.87 4:79 1.73 19.41 
(Mb) (Md) (Me) (Mc) (Ma) 

Sum 34-57 28.23 31.98 35.82 24.71 /155-31 











p! 


ol 


ys 


SC 


en? ew fA) eZ 


au- 
l to 
nts. 
nces 
not 
om- 
itive 
ralu- 


Line- 
itial 
in- 


ency 


and 





EXPERIMENTAL INVESTIGATION OF RHETORICAL CLARITY 250 


produced a test which was less heavily 

weighted on the evaluative dimension. 

The same analysis of variance that was 

originally performed was repeated with 
TABLE XIII 


ANALYSIS OF VARIANCE OF 
DATA-MEDIAN PROFILES. 








Written Form 








Source of Mean 
Variation df Square F 
Columns 

(audiences) 4 4-19 2.87 
Rows (speakers) 4 11.45 784° 
Treatments 

(pictures) 4 53 — 
Residual 12 1.46 








*Significant at the 1% level. 


this new data. The results of this anal- 
ysis are reproduced in Tables X, XI, 
XII, XIII. 

As can be seen from this analysis, 
some minor changes were produced, but 


the significance of the communicator 
differences remains unchanged, and the 
relative position of the speakers with 
the highest and lowest D scores remains 
unchanged. Apparently the differences 
originally noticed were not due to an 
excessive loading of the original instru- 
ment on the evaluative dimension. 


To further explore the possibility 
that the differences between the speakers 
were due largely to judgments made on 
the evaluative dimension, another anal- 
ysis was made employing the five of the 
nine scales that have almost no loading 
on the evaluative dimension. These five 
scales were the strong-weak, large-small, 
hot-cold, active-passive, and 
scales. The results of these calculations 
are reported in Tables XIV, XV, XVI, 
and XVII. 


fast-slow 


TABLE XIV 
D ScOoRES—SPEAKER PROFILE AND AUDIENCE MIEDIAN PROFILE 
ACTIVITY AND POTENCY SCALES. 























Oral Form _ 
_ Audience 
Speaker | Il Ill IV V Sum 
1 5.83 6.85, 3.16 5-29 6.32 27.45 
(Me) (Md) (Mb) (Ma) (Me) 
2 5.48 3.60 6.40 4.90 3.16 23.54 
(Md) (Mc) (Ma) (Me) (Mb) 
3 7.28 4-47 6.08 8.66 5.00 31.49 
(Mb) (Me) (Mc) (Md) (Ma) 
4 4.00 3.16 5.10 4-69 3.16 20.11 
(Ma) (Mb) (Me) (Mc) (Md) 
5 2.64 1.41 4-24 4-90 3.87 17.06 
(Me) (Ma) (Md) (Mb) (Mc) 
Sum 25.23 19.49 24.98 28.44 21.51 /119.65 
TABLE XV It will be noted that the results of 


ANALYSIS OF VARIANCE OF DATA-MEDIAN PROFILES 
ACTIVITY AND POTENCY SCALES. 




















Oral Form 

Source of Mean 
Variation df Square F 
Columns 

(audiences) 4 2.29 
Rows (speakers) 4 6.69 3.62* 
Treatments 

(pictures) 4 1.55 -—— 
Residual 12 1.85 








*Significant at the 5% level of confidence. 


the original analysis change very little 
even with this violent departure from 
the original conditions of investigation. 
The differences between the communica- 
tors are still significant, although only 
at the 5 per cent level of confidence. 
Apparently, the activity and potency 
scales enable us to distinguish between 
the messages just about as well by them- 
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TABLE XVI 
D ScorES—SPEAKER PROFILE AND AUDIENCE MEDIAN PROFILE 
ACTIVITY AND POTENCY SCALES. 
Written Form 
Audience | 
Speaker ——_— Sa —_ltCtCMC Ce Sum 
1 6.56 5-20 aa 5.09 3.32 6.00 26.17 
(Me) (Mc) (Ma) (Mb) (Md) 
2 5-92 5-10 3-74 6.56 3.16 24.48 
(Md) (Me) (Mc) (Ma) (Mb) 
3 6.48 5-00 8.48 7:07 4.36 31.39 
(Mc) (Ma) (Mb) (Md) (Me) 
4 4-47 2.64 3.16 5-10 4.12 19.49 
(Ma) (Mb) (Md) (Me) (Mc) 
5 4-59 3-74 2.50 3.87 1.41 16.21 
(Mb) (Md) (Me) (Mc) (Ma) 
Sum 28.12 21.68 22.97 25.92 19.05 /117.74 








TABLE XVII 
ANALYSIS OF VARIANCE OF DATA-MEDIAN PROFILES 
ACTIVITY AND POTENCY SCALES. 








Written Form 





Source of Mean 
Variation df Square F 
Columns 

(audiences) 4 2.44 —- - 
Rows (speakers) 4 7.13 3.58° 
Treatments 

(pictures) 4 31 —_— 
Residual 12 1.99 








*Significant at the 5% level of confidence. 


selves as when the evaluative scales are 
added. It should also be noted that this 
analysis produces the highest total D 
for speaker 3 and the lowest D total for 
speaker 5 as did the two preceding anal- 
yses. 

We may conclude, therefore, that the 
distinctions between the speakers dis- 
covered in the original investigation 
are not due solely to an overloading of 
the scales on the evaluative dimension, 
since repeated analyses which remove 
more and more of the evaluative factor 
merely operate to diminish the dif- 
ferences and not to cause them to dis- 


appear. 
The Ohio State Psychological 
Examination 


In considering the relationships be- 
tween the scores obtained by the mem- 


bers of the audiences on the O.S.P.E. 
and their total D scores, two primary 
limitations must be considered. First, 
there was no reason related to the de- 
sign or hypotheses of the experiment for 
the selection of the O.S.P.E. instead of 
any other test of verbal ability. The rea- 
sons for the choice were simply availabil- 
ity and convenience. The Ohio State 
Psychological Examination is adminis- 
tered as part of the freshman testing pro- 
gram at the University of Oklahoma, 
and the scores obtained on the exam- 
ination were available for the members 
of the audiences participating in the 
experiment. For this reason, it might 
not be as useful as some testing instru- 
ment selected with the design and hy- 
potheses of the experiment in mind. 


The second limitation regarding the 
O.S.P.E. scores is that scores on the 
examination were not available for all 
of the students serving as members of 
the audiences for the experiment, since 
transfer students to the University of 
Oklahoma are not required to take the 
freshman test battery. Thus scores were 
not available for those members of the 
audiences who were transfer students. 
Out of a total of 206 students who were 
audience members in this experiment, 
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O.S.P.E. scores were unavailable for 6o. 
However, there seemed to be no reason 
to expect these omissions of O.S.P.E. 
scores to be other than random in rela- 
tion to the experiment, and therefore 
relationships were simply calculated 
utilizing the scores available. 

Of the four analyses of variance which 
were calculated, a significant relation- 
ship appeared in only one. Since there 
were nO @ priori reasons to expect sig- 
nificant relationships in certain combina- 
tions and not in others, it would not be 
sound to select the largest F and argue 
that it represents the most highly sig- 
nificant relationship, especially since 
the four analyses of variance indicated 
only one significant F. However, this F 
is significant at better than the 1 per- 
cent level, which argues fairly strongly 
that this particular analysis of the data 
indicates a real relationship and is not 
merely capitalizing on chance. However, 
because of this limitation, the significant 
F in the analysis of variance between 
the O.S.P.E. reading score and the total 
D scores on the oral form of the test 
should be regarded as indicative but not 
conclusive. 


The Judgments of the 
Rhetorical Critics 

The lack of relationship between the 
D scores obtained for the various mes- 
sages and the judgment of these mes- 
sages as to their clarity by rhetorical 
critics needs to be further explored. 
Clarity is defined in this study as the 
relationship between a_ speaker's in- 
tended meaning and an audience’s ob- 
tained meaning, and the D scores ob- 
tained are intended as a measure of 
this relationship. The only information 
the rhetorical critics had as to the in- 
tended meaning of the speaker was such 
information as could be obtained from 
inspection of the messages. From an in- 
spection of the messages, the critics had 


to determine the speaker’s intended 
meaning, and decide how effectively 
this particular would com- 
municate that meaning to an audience. 
If the measure proposed in this study 


is valid, we were unable to demonstrate 


message 


that the rhetorical critics were able to 
do this. Actually, this failure is not too 
surprising. It seems doubtful that there 
is any necessary relationship between 
the form and content of a message and 
the speaker’s intended meaning. Since 
the definition of clarity proposed here 
agrees with definitions proposed by rhe- 
torical theorists, this lack of relationship 
would appear to be a criticism, not of 
the critics, but of the definitions of clar- 
ity which appear in traditional rhetorical 
theory. 


The “Cloze Procedure” 

The “cloze procedure” is intended as 
a measure of the “comprehensibility” 
of a message. It would be expected that 
the comprehensibility of a message 
would be a factor in its clarity, or in 
other words, the “understandability” of 
a message generated by a speaker would 
be a factor in how well that message 
would communicate a speaker’s intended 
meaning to an audience. On the other 
hand, it would not be expected that 
the comprehensibility of a message 
would determine or fix its clarity. From 
these considerations, one would predict 
that the clarity of a message and its 
comprehensibility would be _ positively 
correlated, but that the correlation 
would be fairly low. The correlation 
obtained (R = +.26) is not significant 
at the 5 per cent level, but a correlation 
this size would be obtained between un- 
related variables by chance only about 
once in ten times. It would thus appear 
that while this experiment is unable to 
establish a significant relationship be- 
tween “clarity” as here defined and 
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“comprehensibility” as defined by the 
“cloze procedure,” further exploration 
of this relationship is worthwhile. 


SUMMARY 

This study has investigated a_par- 
ticular aspect of rhetorical style, the 
aspect of clarity. Clarity is defined as 
the relationship between a speaker's in- 
tended meaning for a message and the 
audience’s obtained meaning from the 
message. The semantic differential is 
employed as an instrument to measure 
these meanings, and the D score is em- 
ployed as a measure of the extent of 
their difference. 

The results of the experiment indi- 
cated that the communicators did vary 
significantly in their ability to convey 


their intended meaning to an audience; 
that the written form of the messages 
may have been more effective, and that 
the study was unable to relate the com- 
munication of a _ speaker’s intended 
meaning to clarity as defined by rhetor- 
ical critics. 

It was also found that the ability of 
a member of an audience to obtain a 
speaker’s meaning in oral presentation 
was related to his reading score on the 
O.S.P.E. 

Finally, it was found that there was 
no apparent relationship between clar- 
ity as defined here, and comprehensibil- 
ity as defined by the “cloze procedure.” 
While the obtained correlation was posi- 
tive, it was not significantly larger than 
could be expected by chance. 
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A POSSIBLE BASIS FOR THE ASSOCIATION OF 
VOICE CHARACTERISTICS AND PERSONALITY TRAITS 


EDITH B. MALLORY and VIRGINIA R. MILLER 
Wellesley College 


I. BACKGROUND 

5 gran is a widespread tendency for 

listeners to attribute specific per- 
sonality characteristics to an individual 
on the basis of his speech. Audiences 
often derive their impressions of a 
lecturer from the sounds of his voice as 
well as from the words that he utters. 
It is customary for personnel managers 
to be influenced in their decisions by 
the voices of those they interview. In 
the informal conversations of daily life 
the majority allow their judgments of 
persons to be affected by cues of this 
sort. In all these cases there is evidently 
an implicit assumption that the voice is, 
to some degree, a reflection of the per- 
sonality. Psychologists recognize this pos- 
sibility, and include various aspects of 
speech among the “expressive move- 
ments’! which they regard as vehicles 
of personality projection. 

In experimental investigations the 
relationships between voice and person- 
ality have been most often approached 
by attempting to determine whether or 
not certain personality characteristics 
can be discerned from the total voice 
pattern. Kelly? found that personaility 
ratings by acquaintances agreed more 
closely with self-ratings than did those 
made by radio listeners who had never 
seen the speakers. Allport and Cantril? 
_ 1G. W. Allport. and P. E. Vernon, Studies 
in Expressive Movement (New York, 1933). 

2E. L. Kelly, “Personality As Revealed by 
Voice and Conversation Without Face to Face 
Acquaintance,” Psychological Bulletin, 35 
(1938), p. 710 (Abstract). 

3G. W. Allport and H. Cantril, “Judging 
Personality from Voice,” Journal of Social Psy- 
chology, 5 (1934), PP- 37-55: 


found the relationships between . the 
scores on a number of personality tests 
and estimates of personality characteris- 
tics as judged from the voice. Judgments 
based on the voice were compared with 
objective test scores for dominance by 
Eisenberg and Zalowitz,* and by Fay 
and Middleton for introversion,® soci- 
ability,6 and 
‘Types.? 


Spranger Personality 


A number of investigators have been 
concerned with the degree to which ef- 
fectiveness in speech correlates with 
specific personality characteristics. Gilk- 
inson and Knower®*-® studied the relation- 
ship between the ratings of effectiveness 
in speech and scores on the Bell Adjust- 
ment Inventory and other tests. Cheno- 
weth’® reported Bernreuter scores and 


4P. Eisenberg and E. Zalowitz, “Judging Ex- 
pressive Movement: III. Judgments of Dom- 
inance-Feeling from Phonographic Records of 
Voice,” Journal of Applied Psychology, 22 
(1938), pp. 620-631. 

5 P. J. Fay and W. C. Middleton, “Judgment 
of Introversion From the Transcribed Voice,” 
Quarterly Journal of Speech, 28 (1942), pp. 226- 
228. 

6 P. J. Fay and W. C. Middleton, “The Abil- 
ity to Judge Sociability from Voice As Trans- 
mitted Over a Public Address System,” Journal 
of Social Psychology, 13 (1941), pp. 303-309. 

7P. J. Fay and W. C. Middleton, “Judgment 
of Spranger Personality Types From the Voice 
As Transmitted Over a Public Address System,” 
Character and Personality, 8 (1939), pp. 144-155. 

8 Howard Gilkinson and Franklin H. Know- 
er, “Individual Differences Among Students of 
Speech As Revealed by Psychological Tests,” 
Quarterly Journal of Speech, 26 (1930), pp. 243- 
“J2° 

9 Howard Gilkinson and Franklin H. Know- 
er, “A Study of Standardized Personality Tests 
and Skills in Speech,” Journal of Educational 
Psychology, 32 (1941), pp. 161-175. 

10 Eugene C. Chenoweth, “The Adjustment of 
College Freshmen to the Speaking Situation,” 
Quarterly Journal of Speech, 26 (1940), pp. 585- 
588. 
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case history data of effective and non- 
effective speakers. Dow also studied 
personality test results as related to ef- 
fectiveness in public speaking. Further 
studies in this area have been made 
by Duncan,'* Eckert and 
Moore,** ?® and Murray.?® 17 


Keyes,*® 


Since even the total voice pattern is 
only an approximate indication of per- 
sonality, one cannot suppose that close 
relationships would be found between 
personality traits and partial aspects of 
speech. Sapir'® discussed the possibility 
of such relationships but offered no ex- 
perimental or quantitative evidence to 
confirm their reality. Moses, ?° also, 
analyzed various aspects of voice and 
suggested their connection with psycho- 


logical conditions. Duncan*! reported 


significant correlations of quality, pitch, 
and force with scores on the Bell In- 
ventory and the Minnesota Personality 
Scale by Darley and McNamara. She 


11 Clyde W. Dow, “The Personality Traits of 
Effective Public Speakers,” Quarterly Journal of 
Speech, 27 (1941), Pp. 525-532. 

12 Melba Hurd Duncan, “An Experimental 
Study of Some of the Relationships Between 
Voice and Personality Among Students of 
Speech,” Speech Monographs, 12 (1945), pp. 
47-60. 

13 R. G. Eckert and N. Keyes, “Public Speak- 
ing As a Cue to Personality Adjustment,” Jour- 
nal of Applied Psychology, 24 (1940), pp. 144- 
153- 
14 Wilbur E. Moore, “Personality Traits and 
Voice Quality Deficiencies,” Journal of Speech 
Disorders, 4 (1939), pp. 33-36. 

15 Wilbur E. Moore, “Factors Related to 
Achievement and Improvement in Public Speak- 
ing,” Quarterly Journal of Speech, 29 (1943), 
Pp. 213-217. 

16 Elwood Murray, The Speech Personality 
(Philadelphia, 1937), chapt. 6. 

17 Elwood Murray, “A Study of Factors Con- 
tributing to Mal-Development of the Speech 
Personality,” Speech Monographs, 3 (1936), pp. 
99-108. 

18 E. Sapir, “Speech As a Personality Trait,” 
American Journal of Sociology, 32 (1927), pp. 
892-905. 

19Paul J. Moses, “Social Adjustment and 
Voice,” Quarterly Journal of Speech, 27 (1941), 
PP- 532-537- 

20 Paul J. Moses, The Voice of Neurosis (New 
York, 1954). 

21 Duncan, op. cit. 


found only non-significant relationships 
between Bernreuter scores and students’ 
ratings of these voice factors, and so ex- 
cluded the Bernreuter results from her 
final investigation. Moore,?* however, 
used this-test as well as self-ratings in an 
investigation of students whose voices 
were breathy, harsh or metallic or had 
a nasal whine. Sanford?* 24 found indi- 
cations of personality characteristics in 
particular aspects of speech, but his 
analysis was primarily concerned with 
features such as literary style and vo- 
cabulary, i.e., at levels more complex 
than voice qualities. 

The writers of this paper are interested 
not so much in identifying speech char- 
acteristics as symptoms of personality 
conditions, but rather in the possibility 
of formulating a basic principle which 
might account for the 
which do exist. In general, investigators 
whose aim has been to find diagnostic 
clues to personal adjustment have dis- 
covered that it is easier to judge per- 
sonality from total speech patterns than 
from any of the simpler, analyzed factors 
such as. pitch or rate. Allport, for in- 
stance, says, “To attempt to correlate 
pitch with one personal quality, speech 
with another, and intensity with a third 
would be to make the whole problem 
absurdly atomistic, and, as is the case 
with all studies which seek correlations 
between mere meaningless fragments of 
well-structured personalities, the study 
would be foredoomed to failure.”25 This 
will be so if one is concerned primarily 
with individual prognoses, but in an in- 
quiry designed to discover a_ tenable 
basis for the _ relationship between 


relationships 


22 Wilbur E. Moore, “Personality Traits and 
Voice Quality Deficiencies,” pp. 33-36. 

23 F. H. Sanford, “Speech and Personality: A 
Comparative Case Study,” Character and Per- 
sonality, 10 (1942), pp. 169-198. 

24F. H. Sanford, “Speech and Personality,” 
Psychological Bulletin, 39 (1942), pp. 811-845. 

25 Allport and Cantril, op. cit., p. 39. 
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speech and personality and the pro- 
cedure which Allport rejects may actu- 
ally be indicated. A fundamental pat- 
tern of relationships which occurs with 
significantly greater than chance fre- 
quency may be of theoretical value, 
even though the correlations which are 
found prove to be far too low to war- 
rant clinical determinations. 


II. PROBLEM AND HYPOTHESIS 

Insofar as any expressive movements 
are indications of phases of personality, 
we should ask how such forms of re- 
actions may have become established. 
It is not genuinely explanatory to say 
that a particular subjective trait “makes 
us” act in a particular way, especially 
in the case of physical habits over 
which we at times exercise little delib- 
erate control. Some voice habits may be 
largely dependent on physical structure 
whereas others may have been devel- 
oped chiefly through imitation or an- 
other type of learning. It is the writers’ 
hypothesis that some vocal habits which 
are more directly associated with per- 
sonality traits were developed from 
muscular responses to the very same 
stimuli which originally induced the 
subjective states typical of those traits. 
Thus, both the motor habits and the 
personality traits related to them would 
have been established by the same se- 


quence of earlier events. 


According to this description of the 
possible connection between subjective 
traits and related expressive move- 
ments, the components of an individ. 
ual’s voice pattern would be the audi- 
ble aspects of certain physical responses 
which have become habitual in social 
situations communication 
with other persons. Some concomitance 


should therefore be 


involving 


discoverable _ be- 
tween those personality characteristics 
and those voice characteristics which 


can be conceived as parallel relations to 
particular kinds of social stimuli. 

In order to test this hypothesis, two 
personality variables were selected— 
dominance as measured by the B4D 
Scale of the Bernreuter Inventory, and 
Introversion as measured by the Bgl 
Scale of the same test. Social dominance 
may be thought of as varying quantita- 
tively, in different individuals, along a 
linear continuum which extends from 
high ascendancy to submission. The 
first extreme represents a highly pos- 
itive attitude effectiveness in 
contacts with persons. As such it leads 
to habitually outgoing tendencies to 
action, especially in those facets of be- 
havior 


toward 


which concern communication 


with the world of other individuals. 
Submission, on the contrary, is asso- 
ciated with habits of acceptance and re- 
straint. 

A series of events (or experiences) 
which would induce the subjective at- 
titude of submission might include re- 
buffs or frustrations in situations in- 
volving other persons. A parallel phys- 
ical response to such stimuli could be 
the inhibition of the actions incipient 
at the time of the event; thus, specific 
restraint of the actions normally em- 
ployed in communicating with other 
persons. 


We might therefore expect that sub- 
mission would be accompanied by a 
tendency toward the tightening of the 
vocal cords and result in giving the 
voice a higher pitch. It might also cause 
a constriction of structures of the air 
passages, producing a tendency toward 
lack of resonance. The curtailment of 
vigorous action in the speech process 
might result also in a reduction of loud- 
ness. Introversion, the habit of directing 
attention toward the self rather than 
toward the outside world, is also often 
related to difficulties in making out- 
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side personal contacts. Insofar as this 
is the case, introversion and submission 
would tend to be concomitant, and this 
is indeed a usual, but not invariable, 
finding. We might therefore anticipate 
that introversion would also tend to be 
associated with those voice characteris- 
tics which could be expected to ac- 
company submission. 

Since tension can lead to a rapid, 
jerky form of activity when compensa- 
tory effort is made, there might also be 
some likelihood of a more rapid rate in 
the speech of those who are found at 
the submissive end of the personality 
scale. In the case of introversion, how- 
ever, which may involve the habit of 
ignoring the outside world rather than 
submitting to it, there would neces- 
sarily be a reason for the compensatory 
effort resutling in rapid rate. In ac- 
cordance with this hypothesis we would 
therefore predict that dominance scores 
would be positively correlated with 
loudness, resonance, lower pitch and 
slower rate. We would predict also that 
introversion scores would correlate neg- 
atively with loudness, resonance, and 
lowness of pitch, but would not show 
any connection with rate. 
Ill. THe EXPERIMENTAL INVESTIGATION 

Procedure. The subjects of this - in- 
vestigation were all the members of an 
entering class at Wellesley College, a 
group relatively unselected in respect 
to speech and personality traits studied. 
Individual tests were given by the 
Speech Department, in which judg- 
ments were made of the voice of each 
student as she read aloud and spoke in 
a standard testing situation. Since the 
primary object of the examination was 
to identify those who might need to at- 
tend special speech classes, particular 
attention was paid to voice characteris- 
tics which might be construed as “de- 


fects”; and therefore there was no step- 
wise rating of a quality such as pitch or 
rate, but note was taken of those cases 
in which the former was especially high 
or the latter was exceptionally rapid. 
Notations were made also of other out- 
standing characteristics, both desirable 
and undesirable, in descriptive words 
such as “hard,” “resonant” and the like. 
In addition, the judges indicated those 
instances in which the manner of speak- 
ing was clearly “confident” or “self- 
conscious.” It should be made quite 
plain that these latter terms were. not 
attempts at general personality judg- 
ments, but were intended to define the 
impression made by the voice itself. 
During the first week of their stay 
at college, the Bernreuter Personality 
Inventory was administered as a group 
test to these students. It was recognized 
that no questionnaire is an adequate 
measure of personality trends, but since 
chance errors in such scores would re- 
duce, rather than enhance, the extent 
of any that might be 
found, it was felt that these scores of- 
fered conservative, as well as objective, 


relationships 


criteria for comparison. To facilitate 


statistical treatment, the Dominance 
score (B4D) and the Introversion score 
(BgI) were converted into T scores based 
on the Wellesley norms for the test. For 
these, the mean was 50 and the stand- 
ard deviation io. 

Foreign students whose English 
speech was not properly comparable to 
that of the other students were excluded 
from the study, and no use was made of 
the partial records of the few individ- 
uals who missed one or the other test. 


Complete records were available for 372 


subjects. 
Results. Biserial correlations were 
found between each of the two _ per- 


sonality measures, Dominance and In- 


troversion, and the incidence of the 
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BIsERIAL CORRELATIONS OF VOICE CHARACTERISTICS AND BERNREUTER SCORES N = 372 





























Dominance Introversion 
I or CR P r or CR P 
bis bis bis bis 

High Pitch —.15 .08 1.87 03 13 07 1.85 03 
Inadequate Loudness —.32 07 4.57 .00 14 .08 1.75 04 
Resonance 14 08 1.75 .O4 —.24 07 3-43 < 01 
Rapid Rate —.17 .08 2.12 02 .00 ~- -— —— 
Confident Manner .28 -10 2.80 < 01 —.22 07 3-14 < 01 
voice characteristics recorded. The re- 


sults found in the accompanying table 
show the degrees of positive or negative 
relationships as indicated by the biserial 
correlation (T,;.)°° With 
each coefficient is given its standard 
error (71, ;.), and, in the CR column, 


coefficients 


the value found by dividing ase by 
*r,,.- Lhe column headed P reports the 
level of confidence for each coefficient, 
i.e., the number of chances per hundred 
that the calculated relationship would 
occur merely by chance. 

High pitch was found to correlate 
with a coefficient of 
+.13 Introversion. 
These relationships are extremely slight, 


—.15 with Dom- 


inance, and with 


but obtain in the expected directions, 
at the g per cent level of confidence. In- 
adequate loudness shows, with Dom- 
inance, a correlation of —.32. This also 
is low, but since it is four and one-half 
times its standard error, it must be in- 
terpreted as very probably due to caus- 


ative factors other than chance. Inad- 


equate loudness and Introversion are 


less definitely related, with a coefficient 
of .14 at the 4 per cent level. Resonance 


26 For High Pitch, Inadequate Loudness and 
Resonance, the biserial correlations were found 
by using Formula gg, Garrett. Henry E. Gar- 
rett, Statistics in Psychology and Education 
New York, 1937), p- 369. In the case of Rate 
and Manner, judgments made possible the use 
of groups categorized at the opposite extremes 
of the scale, ie., rapid versus slow, and con- 
fident versus self-conscious. In these cases For- 
mula 206, Peters and Van Voorhis, was used. 
Charles C. Peters and Walter R. Van Voorhis, 
Statistical Procedures and Their Mathematical 
Bases (New York, 1940), p. 385. 


gives a correlation of .14 at the 4 per 
cent level with Dominance, and —.24, 
at the 1 per 
version. 


cent level, with Intro- 
It was suggested that rapid rate might 
be a concomitant of Submission and so 


show a negative relationship with Dom- 


inance scores, and this is, perhaps, 
shown by the coefficient of —.17 at the 
2 per cent level of confidence. No cor- 
relation whatever appeared between 


rapid rate and introversion. 

While the specific object of this in- 
vestigation was the testing of our hy- 
pothesis regarding specific voice vari- 
ables, it is interesting and not irrelevant 
to note also our findings regarding the 
personality measures in relation to the 
judgments of confidence versus self- 
conscious manner of speech. Confident 
manner correlated .28 with Dominance 
and —.g2 with Introversion; and both 
of these coefficients show a relationship 
at the 1 per cent level of confidence. 
While these figures were not high, they 
exceed the majority of those found in 
connection with the isolated voice qual- 
ities. This corroborates the finding of 
other experimenters that an observer's 
judgment tends to be based not on iso- 
lated aspects, but on the total pattern 
presented. It is of course possible that 
the bearing of the student, i.e., the vis- 
ible manner of standing and the facial 
expression may also have contributed 
to the verdicts. It 
acteristics of personality 


appears that char- 
reveal them- 


selves more clearly in the complex of 
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a variety of expressive movements than 
they do in the isolated aspects of voice 
production, although the latter may be 
components of the total effect. 


Discussion. All of the correlations 
found in this study are low. They offer 
no justification for predicting an in- 
dividual’s _ personality 
from isolated voice characteristics, or 
for predicting the latter from the for- 
mer. For the purposes of this study, 
however, the essential question is not 
“how close are the relationships,” but 
“do any of the expected correlations 
exist to an extent that shows they are 
due to 
chance?’ 


characteristics 


causative factors other than 


, 


While only two of the spe- 
cific voice characteristics showed cor- 
relations on which specific voice char- 
acteristics showed correlations on which 
reliance can be placed at better than the 
1 per cent level of confidence, all of 
them are found to be at the 4 per cent 
level, or better. The findings offer ap- 
preciable evidence that 
sonality and voice characteristics are 
definitely asociated, in ways which sup- 
port our hypothesis that the two types 
of responses may have been conditioned 
concomitantly by events in the individ- 


ual’s earlier life history. 


certain per- 


It may reasonably be asked why, if 
simultaneous parallel conditioning oc- 
curs at all, the association found be- 
tween the personality traits and the 
voice characteristics is so slight. Prob- 
ably the answer lies in the fact that the 
human being is an animal highly sus- 
ceptible to social or cultural influences. 
Speech is learned largely by imitation, 
and a large majority of one’s habits of 
speech may be picked up in the same 
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way. Moreover, there is considerable 
conscious control of the voice, so that 
many modifications may be made de- 
liberately in the direction of socially 
approved standards. Also, since it is 
widely understood that certain idiosyn- 
crasies will be interpreted unfavorably 
by one’s associates, a compensatory ef- 
fort to correct or mask revealing inad- 
equacies will be made,—either inten- 
tionally or unconsciously, even by the 
introverted individual. 
In view of the extensive overlay of mod- 
ifications so acquired, it is of consid- 
erable significance that the pattern of 
relationships which supports the _hy- 
pothesis of this study has been found to 
persist. 


and submissive 


IV. CONCLUSIONS 
1. There is slight but significant pos- 
itive association between the personal- 
ity trait of dominance 
characteristics of loudness, resonance, 
and lower pitch. Submission by contrast 


and the voice 


has a slight but negative association 
with the voice characteristics of loud- 
ness, resonance, and lower pitch. 


2. There is a slight positive associa- 
tion between submissiveness and rapid 
rate of speech. 

3. Introversion is negatively related 
to loudness, low pitch and resonance, 
but is unrelated to rate. 

4. The pattern of relationships found 
in this study supports the hypothesis 
that certain personality traits and cer- 
tain voice characteristics may have de- 
veloped concomitantly. as persisting 
parallel reactions to particular kinds of 
situations involving social communica- 
tion. 
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PERSONALITY AND DISCUSSION BEHAVIOR: A STUDY OF 





POSSIBLE RELATIONSHIPS 


THOMAS M. SCHEIDEL, and LAURA CROWELL 
University of Washington 


and 


JOHN R. SHEPHERD 
University of Oregon 


l* recent years a number of factor 
analytic studies have been carried 
out in the hope of isolating some es- 
sential dimensions of individual _be- 
havior in small groups. The concurrence 
among the various researchers regarding 
the identity of the important factors is 
remarkable. As summarized by Carter,’ 
the three basic dimensions are: 


Factor I: Individual Prominence and 


Achievement 
individual re- 
lated to his efforts to stand out from others and 


These are behaviors of the 
individually achieve various personal goals. 
Factor II: Aiding Attainment by the Group 

These are behaviors of the individual re- 
lated to his efforts to assist the group in achiev- 
ing goals toward which the group is oriented. 
Factor III: Sociability 

These are behaviors of the individual related 
to efforts to establish and maintain cordial and 
socially satisfying relations with other group 
members. 


As an exploratory study from which 
more intensive investigations are pro- 
jected, the research reported here was 
intended to pursue the relationships be- 
tween personality 
structures 


and need 
and effectiveness in group 
discussion. It was also hoped that this 
research would provide some workable 
hypotheses about the relations of per- 
sonality and the three dimensions of 
behavior described by Carter. 

At first glance, this may appear to be 


value 


1 See Carter’s summary of research leading 
to these factors in: Launor F. Carter, “Recording 
and Evaluating the Performance of Individuals 
as Members of Small Groups,” Personnel Psy- 
chology, VII (1954), 477-484- 





a “trait” approach to the study of leader- 
ship, an approach has been 
roundly and soundly criticized. Most 
would agree that the simple listing and 
tabulating of the traits correlated with 
leadership is a “sterile approach.” As 
Gibbs writes: 


which 


. viewed in relation to the individual, lead- 
ership is not an attribute of the personality 
but a quality of his role within a particular 
and specified social system. Viewed in relation 
to the group, leadership is a quality of its 
structure. 

Leadership is both a function of the social sit- 
uation and a function of personality, but it is 
a function of these two in interaction.2 


It is undoubtedly true that in our 
search for some consistent personality 
configurations a field interpretation is 
situ- 
ation. The importance of the situation 


needed, involving a person in a 


in producing the leader must never be 
slighted. As in Barrie’s play The Admir- 
able Crichton, the model butler’s leader- 
ship qualities come to light only after 
his whole party is marooned on a 
deserted island. But neither can those in- 
terested in discussion discount the im- 
portance of the person. In a narrowly 
defined area, such as small group prob- 
lem-solving discussion, leadership cer- 
tainly is not entirely situationally de- 
fined. If there exist some paradigms of 
personality which can be consistently re- 


2 Cecil A. Gibb, “The Principles and Traits 
of Leadership,” Journal of Abnormal and Social 
Psychology, XLVII (1947), 267-268. 
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lated to leadership behaviors, they must 
be fully investigated. A tabulation of 
leader traits is not so much needed as 
studies of the dynamics of personality in 
small groups. The necessity for such in- 
vestigation is mentioned by Gibb as he 
discusses self-assurance as a trait of lead- 
ership: 

A person who believes in himself gives the im- 
pression that he has the skill, power, or abil- 
ity which will enable him to solve the problem 
in hand. 

This is one of those personality traits which 
makes it abundantly clear that individual per- 
sonality cannot be left out of the leadership 
picture. Leadership cannot be exclusively a 
function of the situation as it is seen by an in- 
dependent observer, for individual differences 
clearly affect the social perceptions of some in- 
dividuals by others, and consequently play an 
important part in giving structure to the sit- 
uation for those who are a part of it.3 

With these considerations in mind 
this exploratory study of the personality 
needs and values related to effectiveness 
in discussion groups has been completed. 
It has been the purpose of this research 
to produce some hypotheses regarding 
the dynamics of personality as it relates 
to the observable dimensions of small 
group behavior. 


I. PROCEDURE 

Sixty-six undergraduate students from 
four sections of the basic group discus- 
sion course at the University of Wash- 
ington served as the subjects for this 
study. Each of the subjects completed 
three measurement devices during the 
quarter. These included two personality 
inventories, the Allport-Vernon-Lindzey 
Study of Values* and the Edwards Per- 
sonal Preference Schedule.’ In addition, 


8 Cecil A. Gibb, “Leadership,” Handbook of 
Social Psychology, Volume II, edited by Gard- 
ner Lindzey, (Cambridge, Mass., 1954), 886. 

4 Gordon V. Allport, Philip E. Vernon, and 
Gardner Lindzey, Study of Values, A Scale for 
Measuring the Dominant Interests in Personal- 
ity, (Boston, 1951). 

5 Allen L. Edwards, Edwards Personal Pref- 
erence Schedule, (New York, 1954). 


each student completed the Discussion 
Rating Scale prepared by the authors. 
1. The Allport-Vernon-Lindzey Study 
of Values. This test purports to measure 
personality in terms of the values that 
the individual considers important in 
life. The term ‘“‘values,’”’ as used here, 
can be equated with “interests” or “mo 
tives,” and these values are based upon 
those identified by Spranger® The six 
variables and the loci of their dominant 
interest are: 
1. Theoretical—observation, reasoning and sys- 
tematization of knowledge; 


ho 


Economic—usefulness, practicality; 
Aesthetic—gracefulness, symmetry, fitness; 
Social—sympathy, kindness, unselfishness; 


Cr & OO 


Political—personal power, influence, renown; 
6. Religious—personally satisfying relationships 
with the universe. 

In this scale the respondent is asked to 
indicate his preference for alternative 
answers to forty-five questions; on the 
basis of his responses the relative pref- 
erence for each of these values may be 
determined. 

2. The Edwards Personal Preference 
Schedule. This schedule measures fifteen 
personality variables, derived from those 
proposed as “manifest needs” by Mur- 
ray.’ Edwards tests these needs with a 
schedule containing 225 pairs of state- 
ments. The subjects are to choose that 
statement in each pair which they be- 
lieve to be the more characteristic of 
themselves. These scores provide a meas- 
ure of the following personality needs: 
1. Achievement—need to be successful, to ac- 

complish significant results; 


no 


Deference—need to accept the leadership 

of others; 

g. Order—need to obtain and maintain or- 
ganization and system; 

j. Exhibition—need to receive the notice and 

comment of others; 


6 Eduard Spranger, Types of Men, trans. by 
(Halle, Germany, 1928), 


Paul J. W. Pigors, 
109-246. 


7H. A. Murray, et. al., Explorations in Per- 


sonality, (New York, 1938), 76-85. 
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5. Autonomy—need to be free to act and to 
criticize; 

6. Affiliation—need to build and maintain at- 

tachment to others; 

Intraception—need to analyze motives and 

predict behaviors; 


~I 


8. Succorance—need to receive sympathy, en- 

couragement and favors from others; 

Dominance—need to influence and super- 

vise the actions of others; 

10. Abasement—need to feel guilt, blame and 
inferiority to others; 

11. Nurturance—need to treat others with kind- 
ness, sympathy and generosity; 


oO 


12, Change—need to experience novelty; 

13. Endurance—need to work hard, persistently 
and to the completion of the task; 

14. Heterosexuality—need to have pleasant re- 
lations with members of the opposite sex; 

15. Aggression—need to attack others and to 
avoid personal blame. 


3. The Discussion Rating Scale. This 
instrument, prepared by the authors, is 
a fifteen step rating scale which provides 
a basis for locating each subject along 
a continuum of discussion effectiveness. 
The number of scoring categories em- 
ployed in this scale was based on the 
discussion of rating scales by Guilford.® 
Each student was provided with a copy 
of the rating form on which had been 
listed the names of the class members. 
He was instructed to rate each student 
in his class on the basis of his total im- 
pression of that student’s discussion ef- 
fectiveness, having heard the student 
take part a number of times during the 
term either as member or appointed 
leader of a discussion group. He was 
reminded that the person’s effectiveness 
involved his behavior both in the log- 
ical development of ideas and in the 
maintenance of group atmosphere con- 


ducive to a high calibre of participation 
by all. 


A brief statement of the nature of 
the basic discussion course in which 


these ratings were made is pertinent 


8 J. P. Guilford, Psychometric Methods, 2nd 
Ed., (New York, 1954), 263-301. 


here. Each student participated in a 
small group discussing a topic before the 
class a half dozen times during the term, 
acting as designated leader at least once. 
Concepts of effective membership and 
leadership were evolved in the class 
through frequent and detailed evalua- 
tions of observed behavior. Responsibil- 
ities of members for voluntarily filling 
both logical and psychological needs 
arising in the group were clarified, and 
responsibility of the leader for stimulat- 
ing and focusing the activity of the 
group toward its goal was commonly 
understood. That these responsibilities 
are not deemed the singular province of 
either the designated leader or the mem- 
bers makes possible a composite judg- 
ment. 


One further step was taken in the 
preparation for this research. Prior to 
the final analysis of the data, the authors 
attempted to predict the tendency of 
the effective discussants on some of the 
key variables. Assuming personality 
needs and values to be related to the 
three dimensions of discussion behavior, 
the authors predicted certain relation- 
ships. These predictions and the results 
are included as Table III and discussed 
later in this paper. 


II. SrAtisTicAL ANALYSIS 


1. Selection of Groups. A scale value, 
or position on the continuum of dis- 
cussion effectiveness, was determined for 
each subject by applying the method of 
equal-appearing intervals to the rating 
scale data. Following a rank ordering of 
the subjects according to these values, 
those discussants who fell in the upper 
or lower 27% of their class were se- 
lected. By combining discussants thus se- 
lected from the four sections, it was pos- 
sible to form groups of 18 relatively ef- 
fective discussants and 18 relatively in- 
effective discussants. 
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The raw 
scores for each subject on both personal- 
ity scales were converted to T scores, 
with a mean of 50 and a standard devia- 
tion of 10, by referring to the college 
population norms included in the test 
manuals. The appropriate sex norms 


2. Statistical Procedures. 


were consulted in order that the group 
differences tested would not be influ- 
enced by the sex variable. To test the 
hypothesis that there are no significant 
differences between effective and inef- 
fective discussants, the standard t test 
for independent groups was applied. In 
order to test the hypothesis that this 
group does not differ from the norma- 
tive college populations, it was necessary 
to evaluate the statistical significance of 
the difference found between the means 
of the standardization which 
may be regarded as population means, 
and the means for the effective discus- 
sants. The t test was used for this pur- 
pose. 


groups, 


III. ResuLts 

The results of the analyses described 
above are summarized in Tables I, II, 
and III. Table I contains the data re- 
garding the differences between the rel- 
atively effective and relatively ineffec- 
tive discussants. Two significant differ- 
ences were found. The effective discus- 
sants were significantly lower on Affilia- 
tion and higher on Dominance than the 
ineffective discussants. It is possible to 
generalize, on the basis of Murray’s ob- 
servations and on Edwards’ list of items, 
that the individual scoring low on Af- 
filiation would not have a need to feel 
a sense of belongingness; he would not 
be especially interested in forming new 
friendships or maintaining old ones and 
would not place a high premium on so- 
cial contact in general. A person scoring 
high on Dominance could be character- 
ized as possessing a strong personal need 


to be a leader, to be recognized by a 
group as the leader, to be individually 
prominent in a group. ; 

Table I] summarizes the data regard- 
ing the differences between effective dis- 
cussants and the normative college pop- 
ulations. This comparison reveals three 
Statistically significant differences. The 
effective discussants are significantly 
higher on Dominance and lower on Suc- 
corance and Abasement than the norma- 
tive populations. The Succorance var- 
iable can be considered synonymous 
with the term “support” as used in psy- 
chological terminology. The individual 
who scores low on this variable would 
feel litthe need for increased encourage- 
ment and others. He 


would not have a primary need to have 


affection from 
others show interest, sympathy, and un- 
derstanding toward his personal prob- 
lems. He would not need the support of 
others. A low score on Abasement sug- 
gests that one has little feeling of timid- 
ity, inferiority, or self-depreciation. Of 
especial interest are two variables on 
which the effective discussants did not 
deviate significantly. On the Allport 
Social value and the Edwards Affiliation 
need, contrary to the expectations of 
statistically significant high rankings, 
this group ranked below the mean of the 
normative populations. 

In summary, these data would suggest 
that the effective discussants can be char- 
acterized by a need to be recognized as 
leader, to be independent, to be self- 
confident, and by a lack of concern for 
social contact and affiliation. These sub- 
jects evince a high value in and per- 
sonality need for self-sufficiency and per- 
sonal influence. 

Table III summarizes the results of 
the predictions made by the authors re- 
garding the relationships between the 
personality variables and the dimensions 
of discussion behavior. Such prediction 
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: TABLE I 
hy DIFFERENCES BETWEEN EFFECTIVE AND INEFFECTIVE DISCUSSANTS. 

y 

‘ Effective Ineffective Mean 

Discussants Discussants 

rd- Variable Mean N=18 Mean N=18 Difference t 
lis- Study of Values 
Op- theoretical 53-17 50.56 2.61 92 

economic 46.67 53.00 6.33 1.97 
ree aesthetic 47-33 45-50 1.83 65 
“he social 48.22 45-72 2.50 81 

political 51.56 54-17 2.61 88 
ity religious 52.33 52.06 .27 08 
uc- Personal Preference Schedule 

achievement 52.06 51.94 \ 22 04 
na- deference 50.11 54-61 4-50 1.14 
-ar- order 50.56 52.67 2.11 67 

exhibition 48.39 49-44 1.05 .25 
ous autonomy 49-33 44.83 4-50 1.36 
SV- affiliation 47-72 54-56 6.84 2.33* 

i introception 52.11 49.28 2.83 .g2 
ual succorance 44-44 49.22 4-78 1.54 
uld dominance 59-56 49-78 9.78 sar? 
we- abasement 44-11 49:83 5-72 1.54 
8 nurturance 49-78 52.17 2.39 7 
He change 48.33 50.83 2.50 78 
one endurance 52.33 48.78 3.55 1.00 

heterosexuality 48.28 46.83 1.45 45 
un- aggression 52.61 46.22 6.39 1.97 
rob- *Significant at 5% level of confidence. 
t of **Significant at 1% level of confidence. 
sug- 
nid- TABLE I was difficult because the categories of 
Of DIFFERENCES BETWEEN EFFECTIVE DISCUSSANTS personality are not unequivocal and be- 
on AND NORMATIVE COLLEGE GROUPS. eae : 
cause some of these traits overlapped the 
not Effective behavioral factors. Reference to the trait 
ort Discussants ra é : 
, Variable Mean N = 18 t definitions given earlier should help 
tion comes NIRS ————— ss s eemaiiihiais iis ala 
an Study of Values clarify for the reader the reasoning be- 
‘ theoretical 53-17 1.58 hind the predictions, although a few of 
TBS, peo a ap oe the predictions may require further ex- 
the aesthetic 47-33 1.36 z ; ] L 
social 48.22 81 planation. High aggression was not pre- 
olitical 51.56 75° - ¥ ial BS hte 
ail aioe see 4 dicted for the effective discussants be- 
8 Personal Preference cause the aggression variable in the Ed- 
har- Schedule wards test would not seem to contribute 
d as achievement 52.06 1.32 . ys : 

a deference 50.11 04 to the type of individual prominence 
self- order 50.56 23 hat dis tet hiccitiinn saniill’ i ‘d 
tee exhibition 48.39 58 that discussion theorists would consider 

autonomy 49-33 2 conducive to leadership. For the same 
sub- affiliation 47-72 1.23 ai ae “fi ; 
per- introception 52.11 96 reason it was predicted that this group 
a 44-44 2.30® would score low on the Exhibition var- 
per- dominance 59.56 —- : A ‘ 
abasement 44.11 280% lable. A high Economic value was pre- 
yen 49-78 09 dicted for it was considered that a stress 
ts of change 48.33 1.06 ai Pert 7 : 
— endurance _ 52.33 1.19 on the “practical” and the “useful 
the heterosexuality 0 ‘77 ~——-would contribute to group goal facilita- 
aggression 52.61 1.27 
sions tion behavior. 

: *Significant at 5% level of confidence. . : = - £ thi 1 

ction **Significant at 1% level of confidence. The interesting results of this anal- 
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TABLE III 
A Priori PREDICTIONS OF AUTHORS REGARDING CERTAIN PERSONALITY TRAITS 
RELATED TO THE THREE DIMENSIONS OF DISCUSSION BEHAVIOR. 








INDIVIDUAL 
PROMINANCE 


Grour GOAL 
FACILITATION 


Group 
SOCIABILITY 





high theoretical* 
high political* 
high achievement* 
low exhibition* 
high autonomy 
high dominance* 
low abasement* 
high endurance* 


high theoretical* 
high economic 
high achievement* 
high order 

high endurance* 





high social ‘ 
high affiliation 
high nurturance 








*Those predictions which seem largely to be borne out in terms of the group deviation from 


the normative mean by at least one unit. 


ysis were: 1. The relationships hypoth- 
esized between certain personality var- 
iables and “Individual Prominence” 
were found in the group of effective dis- 
cussants. 2. The relationships hypoth- 
esized between certain personality var- 
iables and “Group Goal Facilitation” 
and “Group Sociability,” with the ex- 
ception of those traits overlapping “In- 
dividual Prominence,” were not found 
in the group of effective discussants. 


IV. INTERPRETATION AND DISCUSSION 


It was the purpose of this exploratory 
study to provide hypotheses for more in- 
tensive investigations. The most provoc- 
ative hypothesis suggested by the data 
of this project can be stated as follows: 
Certain personality need and _ value 
structures can be directly related to be- 
haviors of individual prominence in 
small group discussions; such structures 
cannot be consistently related to be- 
haviors of group goal facilitation and 
group sociability for these latter factors 
involve more fully the interactional ele- 
ments of the situation. 

Some discussion of this suggested hy- 
pothesis may be helpful. It is undoubt- 
edly true that there is no unitary trait 
of leadership. Yet, in narrowly defined 
situations such as small group discus- 
sion, certain persons do seem consist- 
ently to provide most of the leadership 
behavior. When we compare these ef- 


with the 
ineffective 


fective discussants 


group 


average 

group 
member, we do find some significant per- 
sonality differences. 


member or 


The finding with 
the greatest statistical significance in 
this study is the high personality need 
for Dominance evinced by the effective 
discussants. The literature on this point 
has not been consistent. Krech and 
Crutchfield have held that ‘other things 
being equal, those persons who have in- 
sistent needs for dominance, power, and 
prestige may be expected to have higher 
potentiality for leadership.”’® On_ the 
other hand, Cattell and Stice’® have re- 
cently found no significant differentia- 
tion between leaders and non-leaders in 
terms of dominance. Gibb, in the face 
of the previous contradictory studies, 
writes, “Perhaps the best that may be 
said at the moment is that there is no 
clear evidence of the dependence of 
leadership upon dominance-need.”" It 
would seem that the results of this study 
support the view of Krech and Crutch- 
field and stand in opposition to the po- 
sition of Gibb. 

The results of this study also lead to 
the view that the effective discussant 
typically has a personality need for self- 


9D. Krech and R. S. Crutchfield, Theory 
and Problems of Social Psychology, (New York, 
1948), 437- 

10 See R. B. Cattell and G. F. Stice, The Psy- 
chodynamics of Small Groups, (Urbana, 1953) 

11 Gibb, “Leadership,” op. cit., p. 888. 
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confidence and independence. The first 
factor is not surprising, for most relevant 
studies in the literature find leaders to 
be self-confident. Regarding the second 
factor, a number of studies report the 
leader traits of initiative, persistence, 
ambition. Gibb writes, 

One is constrained to point out here that these 
findings do not, in fact, support the frequently 
heard contention that leaders are people who 
force themselves by repeated efforts into po- 
sitions of leadership. Rather, an interpretation 
more consistent with the data would be that 
this trait of “will” and “persistence” is highly 
prized by groups which, in facing problems, 
have previously met failure and rebuff.12 


Not only in groups facing opposition, 
but in many other groups as well, a 
leader exhibiting the characteristic of 
independence may find approval. 

A surprising lack of need for sociabil- 
ity was demonstrated by the experimen- 
tal group. The literature on this point, 
as on dominance, is contradictory, some 
studies showing the social trait and 
others not. The present study, like the 
research of Richardson and Hanawalt,™ 
did not demonstrate any significant need 
by the effective discussants for sociabil- 
ity. They were below the mean on all 
variables related to this factor. An ex- 
planatory hypothesis would be that ef- 
fective discussants have the “capacity” 
or “ability” for social behavior but not 
a personality “value” in or “need” for it. 

In this study of the dynamics of per- 
sonality in a small discussion group an 
attempt has been made to consider the 
possible interaction of personality and 
situation. The behavioral factor of In- 
dividual Prominence seems to have some 
basis in certain of the personality var- 
iables studied. More research designed 
to determine relevant traits and to tap 


12 Ibid., p. 887. 

13H. M. Richardson and N. G. Hanawalt, 
“Leadership as Related to the Bernreuter Per- 
sonality Measures,” Journal of Social Psychol- 
ogy, XVII (1943), 237-267. 


the relationships of these traits with 
this first dimension of behavior is 
needed. Of course, the ultimate test of 
the products of such studies will be their 
value in predicting discussion behavior. 
The behavioral factors of Group Goal 
Facilitation and Group Sociability could 
not in this study be shown to have such 
a basis in personality structure. It is hy- 
pothesized, therefore, that these dimen- 
sions of discussion behavior are products 
of certain aspects of the situation rather 
than directly of personality. Social per- 
ception and group interaction studies 
should clarify relationships among these 
factors and add to our understanding 
of this aspect of communication, 


V. SUMMARY 


This study proposed to examine cer- 
tain personality needs and values in re- 
lation to performance in small group 
discussions. Sixty-six students in basic 
discussion classes rated each other as to 
their overall effectiveness as group dis- 
cussion participants. Analysis of these 
ratings in relation to data provided by 
the Edwards Personal Preference Sched- 
ule and the Allport Study of Values sug- 


gested notable relationships between 
such personal characteristics as_ self- 
confidence, independence and dom- 


inance and the “Individual Prominence” 
dimension of discussion behavior. The 
analysis, however, revealed no such re- 
lationships of personal characteristics 
with the other two behavioral dimen- 
sions, “Group Goal Facilitation” and 
“Group Sociability.” These results sug- 
gest this interesting hypothesis: certain 
personality need and value structures 
can be directly related to behaviors of 
individual prominence in small group 
discussions but such structures cannot 
be consistently related to behaviors of 
group goal facilitation and group so- 
ciability. 
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XPERIMENTAL 

ference method 
principal difficulties: motivating sub- 
jects to feel a concern for the success of 
the discussion and measuring the quality 
of the group product. Both relate to the 
choice of subject matter for discussion 
in the experimental situation. If the 
group’s task, for example, is to solve an 
intellectual puzzle or estimate the num- 
ber of beans in a bottle, the quality of 
the group product is measurable but the 
subject may have little interest in the 
outcome. Substituting a more realistic 


research in con- 


encounters two 


task, such as formulating policy on a 


controversial public question, makes 
measurement difficult without  signi- 
ficantly improving motivation. Both 


realistically motivated discussion and 
measurable outcomes are essential to the 
experimenter’s purpose. The studies 
here reported attempt to avoid these dif- 
ficulties in an investigation of several 
hypotheses related to conference method. 


BAsic METHOD 

The group task. Discussion in the ex- 
perimental situation was of a series of 
hypothetical problems dealing with dis- 
cussion method (Experiments 1 and 2) 
and with ethics and human relations in 
industry (Experiments 3 and 4). Each 
problem, stated in a short paragraph, 
was followed by a list of five solutions, 
the group’s task being to select the best 
solution. Of the five, one, the “pre- 


*This study was supported in part by funds 
granted to the Ohio State University for aid 
in fundamental research by the Research Foun- 
dation of Ohio State University. 


ferred” solution, was known by the ex- 
perimenter to be the best in the judg- 
ment both of a panel of subject-matter 
experts and of a majority of the ap- 
proximately 100 students who had pre- 
viously discussed the problem in an 
undergraduate course in_ discussion 
method. The problems had been pre- 
validated by use as practice material in 
the course. It was thus known that at 
least 30 per cent but not more than 60 
per cent of the students in the course 
choose the preferred solutions before 
discussion and_ that 
opinion toward those sojutions. It was 
known also that the different series of 
problems used in each experiment were 
of approximately equal difficulty. 


discussion moves 


Subjects and _ procedures. Subjects 
were undergraduates enrolled in_ sec- 
tions of a course in discussion method in 
which practice discussion of problems 
similar to those used in the experiments 
was a familiar part of course routine. 
In both practice and experimental ses- 
sions the subject was given a specified 
length of time to study each problem 
and to record on a signed ballot his pre- 
discussion choice of best solution. After 
discussion of the problem for a specified 
length of time each subject recorded his 
post-discussion choice. The two sets of 
ballots from the experimental sessions 
provided the data for the studies. 

Previous to both practice and ex- 
perimental sessions subjects received the 
following advice: 

Each of you will receive three grades on your 
performance in these discussions. The first will 
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be based on the proportion of times you 
choose the preferred solution before discus- 
sion, for you are learning how to study a prob- 
lem before undertaking discussion of it; the 
second will be based on the proportion of 
times you choose the preferred solution after 
discussion, for you are learning how to profit 


from discussion; the third will be based on 


the percentage of the members of your groups 
who choose the preferred solution after discus- 
sion, for you are learning how to help the 
group reach as high a degree of consensus as 
possible on a sound solution. 


Subjects were thus motivated to take 
the experimental situation seriously, to 
exercise sound judgment both before 
and after discussion, and to give the 
group maximum assistance in choosing 
the best solution. 

Measures of the outcomes of discus- 
sion. Three outcomes of discussion were 
employed as criterion variables. The first 
was the amount of shift of opinion oc- 
curring in discussion, measured by the 
number of shifts recorded on pre- and 
post-discussion ballots from one choice 
of solution to another, regardless of 
the correctness of the choice. The sec- 
ond was _ progress consensus, 
measured without reference to correct- 
ness of choice by the Matthews Index 


toward 


of Agreement, which yields scores rang- 
ing from zero to one. The third was 
the amount of improvement in quality 
of judgment effected by discussion, the 
criterion of quality being the keyed 
best solution, as indicated above. As it 
was necessary to vary the method of 
computing this measure somewhat from 
one experiment to another, the specific 
method employed 
plained later. 


in each case is ex- 

Experimental variables. Three var- 
iables were under investigation: the 
length of discussion, varied from two to 


1 Jack Matthews and A. W. Bendig, “The 
Index of Agreement: A Possible Criterion for 
Measuring the Outcome of Group Discussion,” 
Speech Monographs, XXII (1955), 39-42. 
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13 minutes for each problem; the size of 
the group, varied from three to 12; and 
the style of moderation.? 


Three styles of moderation were em- 
ployed. Under the first no moderator 
was designated in advance, though 
nothing was done to prevent one mem- 
ber of the group from emerging spon- 
taneously as moderator, and this un- 
doubtedly happened in some discussions. 
Under the second style, “partial” moder- 
ation, a moderator was designated in ad- 
vance but restricted his activities to reg- 
ulating participation so that all mem- 
bers of the group had an equal oppor- 
tunity to speak. Under the third style, 
“full” moderator was 
instructed to facilitate group thinking, 
not only by regulating participation, but 


moderation, the 


also by making suggestions regarding 
analysis of the problem, by asking ques- 
tions to stimulate thinking, by making 
transitional and summary remarks, and 
by any other means that occurred to 
him as likely to be useful. Except as 
noted later, moderators did not vote 
before or after the discussion and were 
instructed not to express any opinion on 
the merits of any solution or of any ar- 
gument presented in discussion or to 
do anything else to influence the group 
toward selection of a particular solution. 
In no case did the moderator have ad- 
vance information regarding which was 
the preferred solution of any problem. 


With one exception to be noted later, 
all moderators were drawn from an ad- 
vanced course in conference leadership 
and had received about five weeks of 
training and experience in moderating 


2 Here and throughout this paper the terms 
moderator and moderation are preferred to 
leader and leadership to avoid the implication 
which the latter terms carry in much of the 
literature on conference method that the per- 
son designated to facilitate group thinking 
may, or should, direct group thinking toward 
a particular conclusion. 
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addition moderators 
drawn from this course had been put 
through a short period of intensive train- 
ing in moderating discussion of such 
problems as were to be used in the ex- 
periment and had been tested to insure 
that they could follow instructions re- 
garding the style of moderation to be 
employed. 


discussion. In 


Control of variables. When not under 
investigation, length of discussion, size 
of group, and style of moderation were 
either held constant or _ controlled 
through the experimental design. In ad- 
dition, except as indicated later, the fol- 
lowing variables were either held con- 
stant or controlled: length of subjects’ 
previous training and experience in 
discussion, subjects’ previous acquain- 
tance with the designated moderator, 
subjects’ previous knowledge of the 
problems discussed, length of modera- 
tors’ previous training and experience, 
the various sections of the course, each 
taught by a different instructor, from 
which subjects were drawn, the different 
successive days on which the experiment 
was conducted (not controlled in the 
last two experiments, as no practice ef- 
fect had been noted in the first two), 
the different series of problems used in 
each experiment, the competence of the 
individual moderators employed. 


Except as indicated later for Experi- 
ment 4, the subjects drawn from each 
section of the course in each experiment 
were selected randomly and for the first 
day of the experiment were assigned 
randomly to experimental groups. On 
each succeeding day the same subjects 
were reassigned to groups in such a way 
as to minimize the number of times a 
subject encountered the same classmate 
in a group and to equalize the number 
of such encounters for the various ex- 
perimental conditions. 


EXPERIMENT 1. STYLE OF MODERATION 
The purpose of the experiment was 
to determine whether two styles of mod- 
eration (no designated moderator and 
full moderation) differ significantly in 
terms of the amount of shift of opinion, 
the amount of progress toward consen- 
sus, and the amount of improvement in 
quality of judgment effected by discus- 
sion. 

Sixty subjects, 20 from each of three 
sections of the course, were organized 
into 36 groups of five, each group meet- 
ing on one or another of three successive 
days. Discussion was addressed to 39 
problems divided into three series of 13 
each, one series being discussed during 
a class period and subjects devoting ap- 
proximately one minute to reading and 
two minutes to discussing each problem. 
Eighteen groups were provided with a 
moderator instructed to employ full 
moderation, a total of six moderators 
being employed; 18 groups had no des- 
ignated moderator. 


Shift in opinion. As each of the 60 
subjects recorded a pre- and post-discus- 
sion choice of solution on each of 13 
problems daily for three days, 2340 op- 
portunities to shift in opinion occurred 
during the experiment, 1170 in unmod- 
erated and 1170 in moderated discus- 
sions. In the unmoderated discussions 
375 shifts occurred; under full modera- 
tion 30g occurred. Computation of Chi 
Square yields a value of 8.83, significant 
at less than the one per cent level of 
confidence; thus the data indicate that 
more shift occurred without a designated 
moderator than in fully moderated dis- 
cussion. 


Consensus. Since each of 36 groups 
discussed 13 problems, we may speak of 
468 discussions, 234 under each experi- 
mental condition. For each, pre- and 
post-discussion 


consensus scores 


computed. The means of the pre-discus- 


were 
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sion scores under the two experimental 
conditions (.45 and .43) did not differ 
significantly. For post-discussion scores 
the mean under full moderation was .78; 
that in unmoderated discussion was .86. 
The ¢ test yields a value of 3.48, signifi- 
cant at less than the one per cent level 
of confidence, and the conclusion may 
be drawn that more progress toward 
consensus was made without a desig- 
nated moderator than in fully mod- 
erated discussion. 


Judgment. For each of the three hours 
of participation in the experimental dis- 
cussions each subject turned in a pre- 
and a post-discussion score of soundness 
of judgment, the scores ranging from 
zero to 13, according as the subject was 
more or less successful in choosing the 
preferred solutions of the 13 problems 
discussed during the hour. The number 
of scores under each experimental con- 
dition was go. Before discussion the 
mean score in moderated groups was 
6.39; that in unmoderated groups was 
6.27. The ¢ value of .46 is not significant. 
After discussion the means were 8.10 
and 8.52 in favor of unmoderated discus- 
sion. The ¢ value of 1.51 is significant at 
between the 10 and go per cent levels 
of confidence, suggesting that more im- 
provement in quality of judgment may 
have occurred without a moderator than 
in fully moderated discussion. 


As the result is inconclusive, another 
approach to analysis of the data may be 
made. During the experiment there 
were 2340 instances of a subject partic- 
ipating in discussion of a problem, 1170 
under each experimental condition. In 
each case he could shift from a non-pre- 
ferred to the preferred solution or start 
with the preferred solution and hold to 
it (both categorized as evidence of good 
judgment); or he could shift from the 
preferred to a non-preferred solution, 
shift from one non-preferred solution to 


another, or start with a non-preferred 
solution and hold to it (all categorized 
as evidence of poor judgment). We may, 
therefore, classify the subject’s judgment 
in each instance as good or poor. Under 
full moderation there were 727 instances 
of good judgment; without a moderator 
there were 771 instances. The Chi 
Square test yields a value of 3.43, which 
is significant at approximately the seven 
per cent level of confidence. The data 
strongly suggest, therefore, that better 
judgment was displayed without a des- 
ignated moderator than in fully mod- 
erated discussion. 


STYLE OF 
MODERATION AND LENGTH OF DISCUSSION 

The purposes of the experiment were 
to determine whether two styles of mod- 
eration and three lengths of discussion 
differ significantly in terms of the 
amount of shift in opinion, the amount 
of progress toward consensus, and the 
amount of improvement in quality of 
judgment effected by discussion. 

Sixty subjects, 20 from each of three 


EXPERIMENT 2. 


sections of the course, were organized 
into 36 groups of five each. Half of the 
groups had no designated moderator; 
the other half were provided with mod- 
erators instructed to employ full mod- 
eration. Each of the six moderators was 
assigned to an equal number of discus- 
sions of the three different lengths. Dis- 
cussion was addressed to 24 problems or- 
ganized in nine series in such a way that 
three of the series included four prob- 
lems each, three included eight problems 
each, and three included twelve prob- 
lems each. Each series was discussed dur- 
ing a class period. Subjects studied each 
problem one and a half minutes, while 
the time permitted for discussion of each 
was two minutes, four and a half min- 
utes, or nine minutes. 

Shift of opinion. As each of 60 sub- 
jects recorded a pre- and post-discussion 
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choice of solution for 24 problems, 1440 
opportunities to shift occurred during 
the experiment, 720 in unmoderated 
groups and 720 in fully moderated 
groups. In the former 284 shifts oc- 
curred, in the latter 279 shifts. Since the 
Chi Square value (.046) is not signifi- 
cant, the difference in style of modera- 
tion did not affect shift of opinion. 


The experiment afforded 240 oppor- 
tunities to shift in g-minute discussions, 
480 opportunities in 41-minute discus- 
sions, and 720 opportunities in 2-minute 
discussions. In the three categories the 
percentages of subjects shifting were re- 
spectively 42, 39, and 38. Again the 
Chi Square value (1.26) is not signifi- 
cant, indicating that a hypothesis of no 
difference cannot be rejected. 


Consensus. Pre- and _ post-discussion 
consensus scores for the 288 discussions 
were computed. As expected, analysis of 
variance of the pre-discussion scores 
yielded no significant F ratios. Mean 
post-discussion consensus scores for g- 
minute, 414-minute, and 2-minute dis- 
cussions were respectively .83, .76, .86 
in fully moderated discussion, .93, .85, 
.83 in unmoderated discussion. Analysis 
of the variance of post-discussi9n scores 
is summarized in Table I. 

TABLE I 


ANALYSIS OF VARIANCE OF PoOstT-DISCUSSION 
CONSENSUS SCORES. 














Source of 

Variation df Mean Square F 
Style of 

Moderation 1 060 1.03 
Length of 

Discussion 2 095 1.64 
Interaction 2 145 2.50° 
Within groups 282 058 

Total 287 _ 








*Significant at approximately the 10% level. 


As appears from the table, neither 
style of moderation nor length of dis- 
cussion significantly affected progress 
toward consensus, though the data sug- 
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gest interaction between the two var- 
iables. 


Judgment. As the number of problems 
constituting a series for discussion dur- 
ing a class period varied, the scores of 
individual subjects for the hour’s dis- 
cussion were not comparable. The num- 
ber of members of the group choosing 
the preferred solution for any one prob- 
lem was, therefore, taken as the measure 
of quality of judgment. Groups were 
composed of five subjects each; hence 
the scores ranged from zero to five. Ac- 
cording to expectation, analysis of var- 
iance revealed no significant differences 
among pre-discussion mean scores for 
the various experimental conditions. 
The post-discussion mean scores for 9- 
minute, 4™4-minute, and 2-minute dis- 
cussions were respectively 2.29, 3.04, 3.28 
in fully moderated discussion and 2.50, 
3.15, 3.08 in unmoderated discussion. 
Analysis of the variance of these scores 
is presented in Table II. 

TABLE II 


ANALYSIS OF VARIANCE OF POsT-DISCUSSION 
SCORES OF JUDGMENT. 











Source of 

Variation df Mean Square F 
Style of 

Moderation 1 oO 

Length of 

Discussion 2 12 2.48° 
Interaction 2 1 

Within Groups 282 4.84 





Total 287 








*Significant at between the 5% and 10% levels. 


Since the analysis of variance sug- 
gests, without substantiating, the con- 
clusion that length of discussion affected 
the amount of improvement in quality 
of judgment, the data were analyzed 
also, as in the preceding experiment, in 
terms of whether the subjects’ responses 
demonstrated good or poor judgment 
during discussion. Table III presents a 
breakdown of these data, the left half 
of the table presenting the data on 
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ar- TABLE III 
QUALITY OF JUDGMENT DISPLAYED IN DISCUSSION 
ms Full No Designated g-minute 414-minute 2-minute 
Moderation Moderator Discussion Discussion Discussion 
ur- — — 
of Good 443 433 119 2096 461 
, Judgment (50%) (62%) (64% 
lis- Poor 
m Judgment 227 287 121 184 259 
ng Significance x? = -23 2 = 15.35 
sh. of Difference 50K P< .70 P< 01 
ire 
ere style of moderation alone, the right half lems arranged in four series of four 
ice || that on length of discussion alone. problems each, subjects being given two 
Ac- || It is clear from the table that while minutes to study and seven minutes to 
ar- || style of moderation did not affect qual- discuss each problem. 
ces || ity of judgment, better judgment was Shift in opinion. The experiment af- 
for | displayed in 414- and 2-minute discus- forded 480 opportunities to shift in 
ms. || sions than in g-minute discussions. opinion under full and 480 under par- 
9- tial moderation. The percentages shift- 
lis- EXPERIMENT 3. STYLE OF ing were respectively 35, and 30. The fre- 
.28 MODERATION AND SIZE OF GROUP quencies yield a Chi Square value of 
59, Two styles of moderation and four 2-52, Significant at between the 10 and 
on. | sizes of groups were tested for their dif- 2° Per cent levels of confidence. 
res || ferential effects on shift in opinion, The number of opportunities to shift 
progress toward consensus, and improve- 1 groups of different sizes were as fol- 
ment in quality of judgment. Sixty sub- lows: in groups of three members, 96; 
jects, 15 from each of four sections of 1 groups of six, 192; in groups of nine, 
— | the course, were organized into 32 288; in groups of twelve, 384. The per- 
7 groups, each group size of three, six, centages of shifts actually occurring were 
— | nine, and twelve being equally repre- Tespectively 31, 33, 32, and 32. The dif- 
sented. Half of the groups were pro- ferences obviously are not significant. 
vided with moderators instructed to pro- Table IV presents the data on shift in 
.48* . : ‘ — e . 
4 vide partial moderation, the other half opinion so arranged as to indicate wheth- 
g 
with moderators instructed to provide er style of moderation affected shift in 
a. full moderation. Eight moderators were opinion more markedly in groups of 
ne employed. Discussion was of 16 prob- one size than of another. 
ug- TABLE IV 
| INFLUENCE OF STYLE OF MODERATION AND SIZE OF GROUP ON SHIFT IN OPINION. 
‘on- || 
np | as 
; | Group 3 9 12 
lity —- — — — 
x 1 Style of 
ZEC Moderation Full Partial Full Partial Full Partial Full Partial 
in - 
a Number 
nses Shifting 17 13 27 36 52 39 70 54 
lent Number not 
— Shifting 31 35 69 60 92 105, 122 138 
enone 2— Z 2—1+ LS 2— 
half Significance x” = -44 x? = 152 x”? = 231 X = 2.69 
nin of Difference 50< P<.70 20<P<—.30 10 P< .20 10< P<.20 
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In summary, size of group did not 
affect the proportion of subjects shift- 
ing in opinion during discussion. Style 
of moderation did not significantly af- 
fect the proportion shifting in groups of 
three and six, but the data suggest that 
in groups of nine and twelve more shift 
may have occurred under full than un- 
der partial moderation. 

Consensus. Mean pre-discussion con- 
sensus scores for the six experimental 
conditions did not differ significantly. 
Mean post-discussion scores for groups 
of three, six, nine and twelve under full 
and partial moderation were _respec- 
tively .55 and .72; .48 and .62; .63 and 
453 -58 and .52. Analysis of variance of 
the post-discussion scores is summarized 
in Table V. 

TABLE V 


ANALYSIS OF VARIANCE OF POST-DISCUSSION 
CONSENSUS SCORES. 














Source of 

Variation df Mean Square F 
Style of 

Moderation 1 Ol 

Size of 

Group 3 .06 

Interaction 3 23 1.35 
Within Groups 120 17 

Total 127 








Neither style of moderation nor size 
of group significantly affected progress 
toward consensus. A special study per- 
haps should be made of the possibility, 
slightly suggested by the data, that the 
two variables may interact, partial mod- 
eration being more conducive to con- 
sensus in groups of three and six and 
full moderation in groups of nine and 
twelve. 

Judgment. As the size of group varied, 
the measure of quality of judgment was 
taken as the individual subject’s score of 
right choices of solution for a series of 
four problems, the scores thus ranging 
from zero to four. As expected, no sig- 
nificant differences appeared among the 


pre-discussion mean judgment scores for 
the different experimental conditions. 
The mean _ post-discussion judgment 
scores for groups of three, six, nine, and 
twelve under full and partial modera- 
tion were respectively 1.83 and 2.17; 2.50 
and 2.50; 2.89 and 2.50; 3.19 and 2.27. 
Analysis of variance of the post-discus- 
sion scores is summarized in Table VI. 
TABLE VI 


ANALYSIS OF VARIANCE OF PoOsT-DISCUSSION 
SCORES OF JUDGMENT. 








Source of 











Variation df Mean Square F 
Style of 

Moderation 1 12.15 ied 
Size of 

Group 3 3-79 3.83* 
Interaction 3 3.80 3.84* 
Within Groups 232 .99 

Total 239 . : 








*Significant at approximately the 1% level. 
**Significant at the .1% level. 

Analysis of the data yields several con- 
clusions of interest. First, more improve- 
ment in quality of judgment was ef- 
fected by discussion under full than un- 
der partial Second, the 
amount of improvement varied directly 
with the size of the group. Finally, the 


moderation. 


superiority of full over partial modera- 
tion in effecting improvement in quality 
of judgment varied directly with the 
size of the group: it was inferior to par- 
tial moderation in groups of three mem- 
bers but increasingly superior to par- 
tial moderation as the size of the group 
increased from six to twelve. 


EXPERIMENT 4. LENGTH OF DISCUSSION 
The purpose of the experiment was 
to test the effect on shift of opinion, 
progress toward consensus, and im- 
provement in quality of judgment of 
three lengths of time permitted for 
discussion. 
from each 
of three sections of the course, were or- 


Sixty-three subjects, 21 


ganized into 27 groups of seven mem- 
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bers each, nine groups meeting on each 
of three successive days. Each group of 
seven was composed of two subjects who 
in previous discussion in the course had 
shown better than average judgment on 
problems like those to be used in the ex- 
periment, two who had shown less than 
average judgment, and three who had 
shown about average judgment. Of the 
last three one was chosen randomly to 
serve as moderator of his group for the 
Other group, 
though acquainted with him as a class- 
mate, had never encountered him be- 
fore as a moderator. Each of the 27 


hour. members of the 


_ 


moderators was used once only. Though 
instructed to employ full moderation, 
the moderators were without previous 
training or experience in moderating 
discussion. They were permitted not 
only to vote before and after discussion 
but also to express their own opinions 
freely regarding the choice of best so- 
lution. 

Discussion was addressed to nine prob- 
lems arranged in three series of three 
problems each. Both the three series 
and the nine individual problems were 
controlled in the design of the experi- 
ment. Subjects were given two minutes 
to study each problem, but the length 
of time permitted for discussion of each 
during the hour was varied, one prob- 
lem being discussed for five minutes, 
one for nine minutes, and one for 13 
minutes. The order in which the three 
lengths of discussion were encountered 
during the hour, as well as the three 


lengths of discussion themselves, were 
controlled in the experimental design. 


Shift in opinion. As each of the 63 
subjects discussed three problems under 
each experimental condition, 189 op- 
portunities to shift occurred under each 
condition. Twenty-nine per cent of the 
subjects shifted during the 5-minute 
discussions, 33 per cent during the g- 
minute discussions, and 32 per cent dur- 
ing the 13-minute discussions. The Chi 
Square value (.72) reveals no significant 
differences among these percentages. 

Consensus. An analysis of variance of 
pre-discussion consensus scores, as an- 
ticipated, revealed no significant dif- 
ferences. Post-discussion scores were 
analyzed first by an F test and then by 
t tests for comparison of the various 
pairs of means. The results are tabulated 
in Table VII. 

As appears from the table, the data 
indicate that 13-minute discussions were 
more effective than g-minute discussions 
and probably more effective than »5- 
minute discussions in promoting con- 
sensus. 

The data included for 
each of the 81 experimental discussions 
a pre-discussion and a_post-discussion 
group score of quality of judgment 
based on the number of group members 
choosing the 


Judgment. 


solution. As 
there were seven subjects in each group, 


preferred 


the scores ranged from zero to seven. 
Analysis of variance of the pre-discussion 
scores yielded a significant F ratio. Anal- 
ysis of covariance was employed to cor- 
rect for the influence of these pre-dis- 


TABLE VII 


COMPARISON OF MEANS OF POST-DISCUSSION CONSENSUS SCORES FOR 
PAIRS OF EXPERIMENTAL CONDITIONS. 














5-min. g-min. 5-min. 13-min. g-min. 13-min. 
disc. disc. disc. disc. disc. disc. 
Means 68 58 68 82 58 82 
Significance t= os t = 1.556 t = 2.308 
of Difference 30<P<.40 10< P< .20 02 <— P< .05 
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cussion differences on 
scores. The analysis is summarized in 


Table VIII. 


post-discussion 


TABLE VIII 


ANALYSIS OF COVARIANCE OF POST-DISCUSSION 
SCORES OF JUDGMENT. 














Sum of 
Squares 
Source of | of Error of 
Variation Estimate df MeanSquare F 
Adjusted 
Means 24.72 2 12.36 3.62* 
Within 
Groups 262.67 77 3-41 
Total 287.41 79 








*Significant at between the 1% and 5% levels. 


Analysis of covariance reveals a signifi- 


cant difference among the adjusted 
means (5.72 for 5-minute discussions, 


4.82 for g-minute discussions, and 6.20 
for 13-minute discussions). Computa- 
tion of t values indicates that g-minute 
discussion effective than 13- 
minute (.01<P<.02 
probably less effective than 5-minute 
discussion (.05<P<.10) in promoting 
improvement in quality of judgment. 
The data therefore suggest that the 
three lengths of discussion, in the de- 
scending order of their effectiveness in 
promoting improvement in judgment, 


was less 


discussion and 


may be arranged as follows: 
utes, five minutes, nine minutes. This 
is identical with the order in which the 


1g min- 


three time lengths promoted progress 
toward consensus. 
SUMMARY AND DISCUSSION 

Style of moderation. The three styles 
of moderation under investigation in 
these studies may perhaps be regarded 
as points on a continuum ranging from 
no moderation through partial to full 
moderation. The conception must be 
employed with caution as the no-des- 
ignated-moderator condition does not 
preclude the possibility of a member of 
the group emerging spontaneously to 


But 
held tentatively, the view affords a pos- 
sible explanation of some of the find- 


perform a moderator’s functions. 


ings. 

In 2-minute discussions by groups of 
5 each (Experiment 1) full moderation 
apparently obstructed group process: 
more shift in opinion, more progress 
toward consensus, and probably more 
improvement in quality of judgment 
occurred without a designated mod- 
erator than under full moderation. We 
may conjecture that in so brief a period 
of time as two minutes group members 
can do little more than indicate their 
choice of solution; there is no oppor- 
tunity for the detailed presentation and 
examination of argument. It may well 
be that under these conditions majority 
influence rather than discussion is the 
important factor in group decision. This 
influence brings minority members into 
line, thus increasing the degree of con- 
sensus; and as there is evidence that the 
majority is somewhat more likely to be 
right than wrong on this type of prob- 
lem, majority influence may also im- 
prove quality of judgment. The exertion 
of such influence would be hindered by 
a moderator’s attempt to intervene, 
especially as he would be likely to in- 
sure for minority members more oppor- 
tunity to speak than they might other- 
wise have. In the longer discussions of 
Experiment 2 (nine, four and a half, 
and two minutes, or an average of about 
five minutes), the apparent superiority 
of the no-designated-moderator condi- 
tion disappears, the two styles of mod- 
eration making no significant. difference 
in the amount of shift in opinion, the 
amount of progress toward consensus, 
or the degree of improvement in quality 


of judgment. 

When we turn to the 7-minute discus- 
sions in Experiment 3, in which _ par- 
tial and full moderation were employed 





in | 
six, 
not 
mor 
was 
the 
Six 

mac 
shif 
(ni 
tior 
dat: 
con 
unc 
grol 
full 
imp 
occ 
grol 
pro 
tion 
par’ 
as t 
dat: 
poo 
mol 
mer 
tial 

fore 
has 

here 
erat 


| tial 


Si 
tion 
mus 
affec 
shift 
prog 
amo 
judg 
of t 
sum 
a cr 
crea: 
the 
be. 


por: 
her- 
; of 
ralf, 
yout 
rity 
ndi- 
nod- 
ence 

the 
1sus, 
ality 


scus- 


pa z- 


oyed 

















STUDIES OF GROUP DISCUSSION 


in groups of four different sizes (three, 
six, nine, and 12 members each), we 
note a marked tendency for one style of 
moderation to excel the other, which 
was superior depending on the size of 
the group. In small groups (three or 
six members each) style of moderation 
no difference in 
shift in opinion, whereas in large groups 


made the amount of 
(nine or 12 members each) full modera- 
tion probably effected more shift. The 
data suggest that more progress toward 
consensus was made in small groups 
under partial moderation, while in large 
groups more progress was made under 
full moderation. In ‘groups of three more 
improvement in quality of judgment 
occurred under partial moderation; in 
groups of six, nine, and 12 more im- 
provement occurred under full modera- 
tion, and the superiority of full over 
partial moderation was more apparent 
as the size of the group increased. When 
data from the four group sizes are 
pooled, it is evident that significantly 
more improvement in quality of judg- 
ment was effected by full than by par- 
tial moderation. The data suggest, there- 
fore, that when the size of the group 
has passed a certain critical point, not 
here accurately determined, full mod- 


eration is increasingly superior to par- 


tial in facilitating group process. 





Size of group. When style of modera- 
tion is ignored in Experiment 3, we 
must conclude that size of group did not 
affect either the proportion of members 
shifting in opinion or the amount of 
progress consensus. But the 
amount of improvement in quality of 
judgment varied directly with the size 
of the group. It probably may be as- 


toward 


sumed that when group size has passed 
a critical point, the value of further in- 
crease in size declines, but nothing in 
the data indicates what that point may 
be. 
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Length of discussion. In the groups 
of five in Experiment 2 different lengths 
of discussion (nine, four and a half, and 
two minutes) did not affect the amount 
either of shift in opinion or progress 
toward consensus, but better quality of 
judgment was displayed in 414- and 2- 
minute discussions than in g-minute 
discussions. In the groups of seven em- 
ployed in Experiment 4 under full 
moderation untrained moderators 
length of did not affect 


amount of shift in opinion, but 19- 


by 


discussion 


minute was more effective than g-minute 
discussion in promoting both progress 
toward consensus and improvement in 
quality of judgment, and the data sug- 
gest that in these two respects g-minute 
discussion was inferior even to 5-minute 
discussion. Both Experiments 2 and 4 
suggest, therefore, that nine minutes 
may be inferior to either a somewhat 
shorter or a somewhat longer period of 
time for discussion of the type of prob- 
lem employed in these studies. A pos- 
sible explanation is that in discussion of 
five minutes or less there is little time 
to explore the argument and majority 
influence is strong; as the majority is 
somewhat more likely to be right than 
wrong, its influence not only increases 
consensus, but results in some improve- 
ment in judgment. When nine minutes 
are available for discussion, many ar- 
guments are presented and criticized, 
majority influence wanes, thinking be- 
comes confused, and the quality of 
judgment declines. If still another four 
minutes are available, confusion is dis- 
sipated, the better argument wins over 
the poorer, and the quality of judgment 
reaches its maximum point. Presumably 
here also there is a critical point beyond 
which further prolongation of discus- 
sion does not improve judgment, but 
nothing in the evidence suggests what 
that point may be. 








EFFECTS OF CERTAIN TECHNIQUES OF CREDIBILITY 
UPON AUDIENCE ATTITUDE* 


THOMAS S. LUDLUM 
Capital University 


I. PURPOSE 

T was the purpose of this study to 
padi experimentally, the effects 
of certain techniques for increasing the 
credibility of an argumentative speech 
to an audience. Credibility was evalu- 
ated in terms of effects on the attitudes 
held by the audience toward the sub- 
ject under discussion in the speech. 


The techniques selected for study can 
be designated in Aristotelian terms as 
artistic ethical proofs (elements within 
the context of the speech itself which 
are subject to manipulation by the 
speaker). Five of these techniques were 
selected on the basis of scores which 
they received on an item rating scale 
administered to a group of 
teachers of students of 
speech. 


speech 
and a group 

The techniques thus chosen were: (1) 
acknowledging opposing arguments, (2) 
manifesting a high degree of integrity, 
(3) seeming to lead the thoughts of the 
listeners easily rather than forcing them 
alleged 
facts, 
and accompanying such facts as the 
listeners might regard with doubt, with 
materials which reveal their reliability, 


arbitrarily, (4) making sure 


facts are consistent with known 


and (5) showing that the message is as 
recent and up-to-date as possible. These 


*Based upon a doctoral dissertation com- 
pleted in 1956 at The Ohio State University. 


Professor Franklin H. Knower directed the 
study. 
1For a discussion of this procedure the 


reader is referred to the dissertation from which 
this article is abstrated, A Study for Influenc- 
ing the Credibility of a Communication, avail- 
able from University Microfilms, Ann Arbor, 
Michigan (Mic 57-1846). 


techniques were chosen because they re- 
ceived high scores as determinant fac- 
tors of credibility and because they could 
be incorporated easily into an argu- 
mentative speech. 


Il. ‘HE SPEECHES 

The speeches used in this experiment 
concerned the 1956 general election, 
then some eight months away. These 
speeches presented contentions dealing 
with three major issues. These issues 
were selected on the basis of both an- 
ticipated importance and availability of 
specific developmental materials on both 
sides, Republican and Democrat. Work- 
ing with materials furnished by the Na- 
tional Headquarters of the Republican 
and Democratic Parties the following 
questions were selected for use as issues: 
(1) Are the Republican agricultural 
policies preferable to the Democratic 
agricultural policies? (2) Are the Re- 
publican school policies preferable to 
those of the Democrats? (3) Are the Re- 
publican fiscal policies preferable — to 
those of the Democrats? 

Four straight-forward argumentative 
speeches were prepared, two of which 
were pro-Republican with one limited 
to support of party policy while the 
other concentrated on attacking the 
policies of the Democratic party. The 
other two speeches were pro-Democratic, 
again with one supporting the party and 
the other attacking the opposition 
policies. 

Four speeches in parallel form to the 
straight-forward argumentative speeches 
were prepared, the five techniques of 





cre 
lov 


eds 
De 
sen 
pul 
usi 
of 

spe 


hig 
as: 

thi 
gra 
the 
par 
was 
tho 
tio! 


pro 


ove 


of 
pla 
we! 
not 
orig 
“br 
tru 
the 
we 
4 
tha 
cal 
fine 
are 
par 
me! 
fact 
ing, 
“fac 
acce 
5. 
the 
Tht 


re- 
ac- 
ild 


7u- 
su 


ent 
on, 
IESE 
ing 
ues 
an- 

r of 
oth 
ork- 
Na- 
can 
ying 
ues: 
ural 
‘atic 
Re- 

> ~6—to 
Re- 

to 


itive 
hich 
rited 

the 

the 
The 
ratic, 
y and 
ition 


o the 
eches 
es of 





EFFECTS OF TECHNIQUES OF CREDIBILITY 


credibility being incorporated in the fol- 
lowing manner. 


1. Opposing arguments were acknowl- 
edged, using such statements as: “The 
Democrats contend that . . . this makes 
sense, but on the other hand we Re- 
publicans believe. .’ This involved 
using a brief summary in each speech 
of the arguments used in the opposition 
speeches. 


2. An attempt was made to manifest 
high integrity by the use of such phrases 
as: “I won't try to hide the fact that I 
think the ... party has the better pro- 
gram. This is why I’m speaking for 
them. But I can’t deny that the other 
party has done this.” Another example 
was: “I want to deal today only with 
those facts which my personal investiga- 
tion has revealed to to be solid.”” The 
promise, “I'll never try to pull the wool 
over your eyes,” was used twice. 

3. In an effort to lead the thoughts 
of the listeners easily, emphasis was 
placed on the fact that the issues raised 
were controversial. The “this or 
” statements often used in the 
original speeches were omitted. The 
“briefing type transitions” (e.g. This is 
true because . . .) were omitted. Finally 
the “debate type” conclusions (e.g. Thus 
we see that because . . 


nothing 


.) were omitted. 

4. An attempt was made to insure 
that the facts seemed reasonable. Typi- 
cal phrases added were: (1) “Let's con- 
fine our discussion to those facts which 
are reasonable,” “Now both 
parties have made some extreme state- 
ments, let’s omit these and deal with the 
facts.” In addition to changing the word- 
ing, an effort was made to omit those 
“facts” which seemed less likely to be 
accepted. 


and (2) 


5. An effort was made to show that 
the message was recent and up-to-date. 
Thus, the statistics and examples were 
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dated and such phrases as “the latest in- 
formation available,’ were used. 

The four speeches and their parallel 
forms had many identical factors. The 
same introduction and conclusion were 
used in all. In addition the problem 
statements, farm, school and fiscal were 
exactly the same. The arguments proper 
contained the only variation. It was here 
that the differences between attack and 
support, straight-forward argumentation 
and the parallel forms, and Republican 
and Democratic approaches occurred. 
While each 
unique, 


solution argument was 
with 


length was maintained. 


consistency respect to 


The speeches were examined by four 
teachers in the Department of Speech 
at The Ohio State University and judged 
to be valid for the purposes of identify- 
ing the types represented in this experi- 
ment. 

III. AtrrirupE SCALE 

A Likert-type attitude scale was pre- 
pared to measure audience responses to 
the speeches.? 


A split-half correlation of scores on 
the attitude scale yielded a coefficient of 
+.79, which when corrected by the 
Spearman-Brown formula became +.88. 
The obtained coefficient of correlation 
was interpreted as indicating relatively 
high reliability. Later correlations be- 
tween scores on the pre- and the post- 
test for the control group of 100 sub- 
jects who took the tests one week apart 
tended to confirm this interpretation. 
The coefficient of correlation thus ob- 
tained was +.78. The instrument was 
judged to possess face validity since the 
items came from the stimulus speeches. 

2 Rensis Likert, “A Technique for the Meas- 


urement of Attitudes,” Archives of Psychology, 
140 (1932), 1-55. 
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IV. EXPERIMENTAL SUBJECTS AND 
PROCEDURE 

All of the subjects used in the exper- 
imental phase of this study were college 
students enrolled in courses in speech. 
Most were freshmen and sophomores 
with some junior and seniors and a few 
graduate students. Of the experimental 
subjects, 348 were students at The Ohio 
State University enrolled in an elemen- 
tary course in speech, while 372 were 
students enrolled in speech courses at 
Ohio University. One hundred of the 
students from Ohio University were 
selected as control subjects. Thus, this 
this report contains data on 620 sub- 
jects serving in experimental groups and 
100 additional subjects serving as con- 
trols. Teachers at The Ohio State Uni- 
versity and Ohio University adminis- 
tered the attitude scale to these subjects. 


One week later the experimental sub- 
jects heard one of the four speeches 
Each subject was asked to record his 
name and political preference on an in- 
formation sheet. The name was used 
only to provide a key for comparison be- 
tween the pre- and post-test. The sub- 
ject’s political preference was vital in 
order to provide information on the 
basic political variable. Information was 
then presented concerning the speech. 
At The Ohio State University the sub- 
jects were told that they would hear a 
political speech by a student who was 
president of the Young Republicans or 
Young Democrats Club at Ohio Uni- 
versity, depending on which speech they 
heard. At Ohio University the subjects 
were told that they would hear a politi- 
cal speech by a graduate student at 
Ohio University who had been president 
of the Young Republicans or the Young 
Democrats when he was an undergrad- 
uate at another university. After the in- 
formation was presented, each group 
listened to one speech. As soon as the 


speech had been completed the attitude 
scales were administered. 


V. ANALYSIS OF DATA AND RESULTS 

Two statistical instruments were util- 
ized in analyzing the data. These were 
the t-test of significance, of differ- 
ence of means, and the standard error 
of measurement. 


1. The characteristics of the group 
as a whole were determined by the 
compilation of pre-test and _ post-test 
scores. The scores for the pre-test and 
the post-test were each normally distrib- 
uted, not bi-modally as one might ex- 
pect. 

There was a very definite similarity 
between the distribution on the pre-test 
and on the post-test for the whole sam- 
ple. The theoretical range for the scale 
was 40 to 200 with a theoretical mean 
of 120. On the pre-test the actual range 
was from 40 to 189 with a mean of 
124.9 and a standard deviation of 24.1. 
An the post-test the range was the same 
as the theoretical range, 40 to 200. The 
mean was 124.4 and the standard devi- 
ation was 21.0. 


2. The t-test of the significance of a 
difference between means was used first 
to determine whether the control and 
experimental groups were matched. Sec- 
ond, it was used to determine the sig- 
nificance of the shifts in attitudes asso- 
ciated with the variables studied. 


The t-ratio between the pre-test scores 
of the control and experimental groups 
was only .016, obviously not significant. 
This was construed as indicating that 
the two groups were closely matched. 

In determining the significance of the 
difference in shift associated with the 
variables studied, the technique em- 
ployed involved calculating the amount 
of shift for each individual. For the 
experimental group a movement in the 
direction of the stimulus speech was 
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considered positive, a movement in the 
contrary direction, negative. For the 
control group a change up the scale 
toward the Republican end was con- 
sidered a positive shift for those who 
had scored above the theoretical mean, 
120 on the pre-test, and a negative shift 
for those who had scored below the the- 
oretical mean. Conversely, a change 
down the scale was considered a nega- 
tive shift for those who had scored above 
the theoretical mean and a_ positive 
shift for those who had scored below 
the mean. 


There was a decided difference in the 
shift experi- 
mental and control groups. The mean 
shift for the control group was —2.2 
points as contrasted with the mean shift 
for the experimental group of +93.1 
points. The t-ratio for the difference 
was 4.61, significant at the 1 per cent 
level of confidence. Thus, there was a 
significant difference in the shift made 
by the experimental group as contrasted 
with the control group. 


occurring between the 


The difference in shift between those 
who heard the straight-forward argu- 
mentative speeches and those who heard 
the parallel forms of these speeches 
were significant at the 5 per cent level 
of confidence. The mean shift for those 
who heard the straight-forward argu- 
mentative speeches was +2.9 points. 
The mean shift for those who heard 
the speeches incorporating credibility 
techniques was only +.1 point. The ¢- 
score between those groups was 2.37 
which was significant at the 5 per cent 
level of confidence with 618 degrees of 
freedom. It must be noted that the 
straight-forward argumentative speeches 
were more effective than the parallel 
forms. It is also interesting to note that 
for listeners of the opposing party the 
argumentative speeches caused less nega- 
tive shift, —.3 point to —e.7 points 


However, the ¢-ratio of 1.91 was not 
significant at the 5 per cent level of con- 
fidence with 216 degrees of freedom. 

The most dramatic difference was be- 
tween the listeners who heard speeches 
representing the party with which they 
considered themselves affiliated and the 
listeners who heard speeches represent- 
ing the opposing party. The mean shift 
for the former was +4.9 points, for the 
latter, —1.4 points. The ¢t-ratio between 
these two group shifts was 4.92, the 
largest ¢ obtained and significant at the 
1 per cent level of confidence with 424 
degrees of freedom. 


Moreover, a comparison of the shift 
obtained for these two groups with that 
obtained for the group which declared 
no party affiliation brought to light 
some further significant differences. The 
mean shift for the group who heard 
their own party’s speeches was +4.9 
points. The mean shift for the unde- 
clared group listening to these same 
speeches was +.6 of a point. The t-ratio 
between these two group-shifts was 3.44 
which was significant at the 1 per cent 
level of confidence with 394 degrees of 
freedom. Thus, the group listening to 
their own party’s speeches not only 
shifted significantly more than those 
listening to opposition party speeches 
but also significantly more than the un- 
declared middle group. 

The mean shift reported above for 
the opposition group was —1.4 points. 
The comparison between this shift and 
the +.6 shift for the undeclared group 
resulted in a t-ratio of 2.02 which was 
significant at the 5 per cent level of 
confidence with 404 degrees of freedom. 


The difference between the amount 
of shift for those who heard attack and 
those who heard support speeches was 
not statistically significant. The over- 
all effect of the Republican speeches on 
mixed audiences was not significantly 
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different from the over-all effect of the 
Democratic speeches. The mean shifts 
of Republican and Democratic groups 
were not significantly different; nor were 
the shifts of the group who declared 
their party affiliation and the undeclared 
group. Table I presents the data which 
indicated significant differences in shift 
of attitude. Table II presents the data 
on differences in shift which were not 
statistically significant. 


vice by which one may “determine by 
just how much an individual’s score on 
the test given after hearing the argu- 
ment must differ from his score on the 
first test to amount to a significant dif- 
ference.” 

After the standard error of measure- 
ment was calculated, the computation 
of shifts in attitude was simplified by 
calculation of the significance of shifts 
of ten, twenty and thirty points re- 
































TABLE I 
Group SHIFTS WHICH WERE SIGNIFICANT. 
Number Mean t Level of 
Group of Subjects Shift ratio df Significance 
Experimental group. 620 +3.1 4.61 718 1%, 
Control group. 100 —2.2 
Heard straight-forward argumentative speech. 343 +2.9 2.37 618 5% 
Heard argumentative speech revised to 
utilize credibility techniques. 277 +0.1 
Heard speech representing own party. 208 +4.9 4.92 424 1% 
Heard speech representing opposing party. 218 —1.4 
Declared group hearing speech 
representing own party. 208 +4.9 3-44 394 1% 
Undeclared group. 188 + 6 
Declared group hearing speech 
representing opposing party. 218 —1.4 2.02 4@4 5% 
Undeclared group. 188 + & 
TABLE II 
Group SHIFTs WHICH WERE NOT SIGNIFICANT AT THE 5 PER CENT LEVEL OF CONFIDENCE. 
Number Mean t 
Group of Subjects Shift ratio df 
Heard speeches attacking opposition policies. 319 +2.0 
Heard speeches supporting party policies. 301 + 9 1.18 618 
Heard Republican speeches. 319 +1.9 
Heard Democratic speeches. 314 + & 1.3 618 
Republicans 270 +1 
Democrats 161 +1.5 .29 429 
Declared party affiliation. 43i +1.6 
Did not declare party affiliation. 188 + 6 Ql 617 








3. The standard error of measure- 
ment was used to analyze individual 
shifts in attitude. In his report of the 
effects of oral argument on changes in 
attitude, Knower recommended this de- 
vice as deserving the serious considera- 
tion of all students of attitude measure- 
ment.* He pointed out that it is a de- 

3 Franklin H. Knower, “A Study of the Ef- 


fect of Oral Argument on Changes of Attitude,” 
Journal of Social Psychology, V1 (1935), 327- 


spectively. In terms of the critical ratios 
of the standard error of measurement 
a shift of go points equaled 2.560, .... 
and was significant at the 1 per cent 
level of confidence; a shift of 20 points 


equaled 1.770, and was significant at 


neas 
the 4 per cent level of confidence; and a 


aie and 


was significant only at the 18 per cent 


shift of 10 points equaled .89o 


level of confidence. 
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Evaluated on this basis, the pattern 
of individual change in this study was 
strikingly different from that found in 
the Knower study, referred to above. 
In this study only 17 per cent of the 
experimental subjects shifted positively 
at least ten points. Only 2.1 per cent 
shifted positively at least thirty points. 
There was even less individual shift 
among control subjects. None of them 
shifted positively thirty points while 9 
per cent shifted positively as much as 
ten points. Ten per cent of the controls 
and 9.7 per cent of the experimental 
subjects shifted at least ten points nega- 
tively. 

Fairly large differences in individual 
shift occurred between those who heard 
their own party’s speeches and those who 
heard speeches of the opposing party. 
Of the former, 3.9 per cent moved posi- 
tively thirty points or more. Only 1.4 
per cent of the latter group moved this 
far. Eleven per cent of the favorable 
audience moved twenty positive points 
or more while only 4.1 per cent of the 
opposition audience did. Thirty per 
cent of the favorable audience shifted 
more than ten points positively while 
19.6 per cent of the opposed did. 

The negative shift was more dramatic. 


While 3.9 per cent of the favorable 


group shifted negatively ten points or 
more, perhaps due to regression toward 
the mean; 16 per cent of the other group 
shifted negatively. This later figure is 
more than four times the percentage of 
the former and operated in contradic- 
tion to the tendency to regress toward 
the mean. A more complete tabulation 
of the percentages shifting may be found 
in Table III. 


VI. CONCLUSIONS 


1. The results of this study again 
demonstrated that a significant change 
in attitudes can be brought about by ex- 
posing subjects to a brief oral argument. 
value of formal 
political speaking in winning converts 
to the speaker’s party from the opposing 
party may be overemphasized. The re- 
sults of the study indicate that the 
political speaker is not likely to cause 
those auditors who consider themselves 


2. However, the 


to be affiliated with the opposing party 
to shift their attitudes to a position more 
favorable to his party by the presenta- 
tion of one speech. Considered either 
as individuals or as a group their shift 
was slight and often negative. 

g. Moreover, the political speaker 
may well find that those techniques 
commonly recognized as contributing to 


TABLE III 
INDIVIDUAL SHIFT DATA FOR EXPERIMENTAL VARIABLES. 








Group 


Per Cent Shifting 








Experimental Group 

Control Group 

Heard Speech Representing Own Party 

Heard Speech Representing Opposing Party 

Heard straight argumentative speech 

Heard argumentative speech revised to 
utilize credibility techniques 

Heard attacking speech 

Heard supporting speech 

Heard Republican speeches 

Heard Democratic speeches 

Declared party preference 

Did not declare party preference 


Positive Negative 
go pts 20 pts. 10 pts. 10 pts. go pts. 
2.1 6. 17. 9.7 1.3 
: 2. Q- 10. a. 

3-9 11. 30.3 3.9 0. 
1.4 4:1 19.6 16.0 2.3 
2.3 6.9 18.8 2.6 87 
1.8 5-4 16. 12.9 1.40 
1.9 8.1 20.2 10. 63 
S. 3.9 14.8 10.5 1.60 
1.5 5.6 18. 7.6 9 
1.9 6. 16.6 11.5 1.9 
2.6 7-5 19.5 11.6 1.4 
1.3 2.7 11.7 55 1.1 
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credibility for general communication 
have little real value for him. A straight- 
forward argumentative approach seems, 
from the results of this study, to be more 
effective in political speaking. 

4. Furthermore, the political adage 
that the party in-office should “point 


with pride” while the party out-of-office 
“views with alarm” may not be valid 
advice. The results would seem to indi- 
cate that there is no real difference in 
effectiveness between speeches of attack 
on, and speeches in support of party 
policy. 
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THE RELATIONSHIP BETWEEN THE ABILITY TO SPEAK 
EFFECTIVELY AND THE PRIMARY MENTAL 
ABILITIES, VERBAL COMPREHENSION 
AND GENERAL REASONING* 


JOE M. BALL 
University of Pittsburgh 


INTRODUCTION 
HE literature in the field of speech 
is replete with studies of the re- 
lationship between measures of speak- 
ing effectiveness and various dimensions 
of the physical, emotional, and intel- 
lectual behavior of persons doing the 
speaking. Gilkinson? has published an 
excellent summary of the _ literature 
pertinent to these areas. Representative 
studies seeking to determine the _ re- 
lationship between a measure of speech 
effectiveness and general intelligence 
have indicated consistently that a low, 

positive correlation exists. 

There were two considerations which 
prompted the present study: (1) the 
need for a more precise and definitive 
description of the role which intellectual 
behavior plays in the ability to speak 
effectively and (2) the availabliity of 
“pure’’? factor tests, as represented by 
the Guilford-Zimmerman Aptitude Sur- 
vey,? which appear to offer more sensi- 
tive instrumentation for the explora- 
tion of these relationships than did the 
general intelligence tests used in earlier 
research.‘ 


* Based upon Ph.D. dissertation, The Uni- 
versity of Southern California, 1951, directed 
by Lee Edward Travis. 

1H. Gilkinson, Outlines of Research in Gen- 
eral Speech, Minneapolis, 1943. 

2The use of the term “pure factor” in 
describing a test indicates that the test is so 
constructed and designed that, within certain 
unavoidable limits, it measures one factor or 
ability and one only. 

3J. P. Guilford and Wayne S. Zimmerman, 
“The Guilford-Zimmerman Aptitude Survey.” 

4See also Guilford and Zimmerman, “The 
Guilford-Zimmerman Aptitude Survey,” Jeur- 


This study has limited itself to ascer- 
taining the relationship between speak- 
ing ability and two of the primary 
mental abilities, Verbal Comprehension 
and General Reasoning. 

PROCEDURES 

Measuring Instruments. For the pur- 
poses of this study, speaking ability has 
been defined as the rated ability of an 
individual to explain, clarify, or prove 
the point of a one point speech. 

In order to measure the ability of an 
individual in Verbal Comprehension 
and General Reasoning, parts I and II 
of the Guilford-Zimmerman Aptitude 
Survey were administered. 

This battery was selected with an eye 
for the “pure factor” nature of these 
sub-tests, a selection which was justified 
by the low order of their intercorrela- 
tion. For the subjects of this study, the 
r values were for the males .29, for the 
females .12, and for the combined sexes 
.23. These intercorrelations indicate that 
a subject did not need a high level of 
aptitude for one test in order to attain 
a high score on the other. 

The Sample. All students enrolled 
in the beginning public speaking course 
at the University of Southern California 
during the Spring Semester of 1948 were 
used as subjects in this study. The total 
number of subjects was 221 (187 males 
and 34 females). 

Ages for males and females combined 


nal of Applied Psychology, 32:24-35, Febr- 
uary, 1948, for discussion of the advantages 
in using “pure factor” tests over general intel- 
ligence tests. 
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TABLE I 
MEANS AND STANDARD DEVIATIONS FOR THE AGES AND NUMBER OF YEARS OF SCHOOLING 
OF MALES AND FEMALES, SEPARATELY AND COMBINED. 








Males (187) 
N 


Females (34) Combined (221) 











f o M oc M Pa 
Ages 23.6 1.3 20.2 4 24.1 1.1 
Years of Schooling 14.2 4 13.8 l 13.7 4 








ranged from eighteen to forty-five years, 
with a mean of 24.1, and a standard 
deviation of 1.12. The number of years 
of schooling ranged from nine to nine- 
teen, with a mean of 13.7 and a standard 
deviation of .41. Table I presents the 
distribution of ages, and the years of 
schooling for the subjects used in this 
study. 

Measurement of Verbal Comprehen- 
ston and General Reasoning. Parts I and 
II of the Guilford-Zimmerman Aptitude 
Survey were used to measure the apti- 
tude of each subject for verbal compre- 
hension (Part I) and general reasoning 
(Part II). 

Prior to taking the tests, the subjects 
were told that data gathered would be 
used by their instructors to better evalu- 
ate individual and group problems in 
connection with class work. 

Judging the Speaking Ability of Each 
Subject. The speaking effectiveness of 
each subject was determined on a basis 
of the cumulative judgments of his class- 
mates.® 

The speech used in the judgment 
situation was a three-to-five minute talk 
developing one point through the use 
of verbal forms of support. The emphasis 
in this speech was placed on the use of 
Statistics clarified by the analogy. This 


5 For discussion of the use of students in 
rating general effectiveness of speaking, see: 

A. H. Monroe, H. H. Remmers, and E. V. 
Lyle, “Measuring the Effectiveness of Public 
Speech in a Beginning Course,” Bulletin of 
Purdue University, 37, Studies in Higher Edu- 
cation, 29:27, September, 1936. 

A. I. Bryan and W. H. Wilkes, “Audience 
Tendencies in Rating Public Speakers,” Journal 
of Applied Psychology, 26:371-381, June, 1942. 


pattern was a standard class assignment 
in all twelve sections. Each subject had 
the choice of developing the main point 
of his speech along either inductive or 
deductive lines. 

The judgments of speaking effective- 
ness were expressed in terms of a whole 
number selected from a seven-point rat- 
ing scale. The scale ranged from 1- 
Among most effective to 7-Among least 
effective. 


recorded their 


evaluations of each speaker on a sepa- 


The judges 


rate ballot. The subject’s rating as a 
speaker was the mean of all judgments 
rendered. Prior to the ratings of the 
speakers, each instructor made it clear 
to his section that the judgments ren- 
dered on a speaker would have no in- 
fluence on the grades assigned by the 
instructor to each speaker. Figure 1 is 
a reproduction of the rating scale em- 
ployed in this study. 

In order to estimate the reliability of 
the speech ratings a coefficient of cor- 
relation was computed between odd and 
even speech ratings received by all 
subjects. The first twelve ratings received 
by any subject were selected as the basis 
for comparison. The resulting coefficient 
of correlation was .79, corrected by the 
Spearman-Brown formula to .82. An r 
of this size for a population of two 
hundred and twenty-one indicates a 
fairly high level of agreement among 
the student judges. 


ANALYSIS OF DATA 
I. Characteristics of Group Scores on 
the Measured and Rated Variables. 
Table II presents the means, standard 
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FIGURE I 
ef TTR PE Pree Teer rete Section ... fw aie as SE ed kee 
(1) (2) (3) (5) (6) (7) 
Among Among Among Those Among Among Among 
most more fairly midway barely less least 
effective effective effective between effective effective effective 


When the speaker has finished his speech, judge him immediately as to how well he 


EXPLAINED, PROVED, OR CLARIFIED HIS MAIN 


POINT: mark your rating with a 


whole number according to the scale above and place the rating in the upper right-hand corner. 








TABLE II 
MEANS, STANDARD DEVIATIONS AND STANDARD ERRORS OF THE MEANS OF RAW SCORES 
ON THE VERBAL COMPREHENSION AND GENERAL REASONING APTITUDE TESTS 
AND SPEECH RATINGS RECEIVED FOR MALES, FEMALES, AND 
Botu SeExEs COMBINED. 




















Aptitude Test VCx GRxx SRRxxx 

peat 7 M ef net «© ehUKMtlC lO 
Male (N-187) $5.18 13.48 .98 11.58 4-67 34 2.84 83 .06 
Female (N-34) ge o2 12.12 2 11 _? 8.14 gee 8 2.86 82 15 
Combined (N-221) oa 35.80 12.50 85 _ 4 1.02 4 70 32 2.84 84 06 








xVC—verbal comprehension 
xxGR—general reasoning 
xxxSRR—speech ratings received 


of the 
means of raw scores on the aptitude 


deviations and standard 


errors 
tests and speech ratings received for 
males, females, and the combined sexes. 

Scores on the Verbal Comprehension 
and General Tests were 
determined by the scoring formula in- 


Reasoning 


cluded in published directions for the 
administration of the Guilford-Zimmer- 
man Aptitude Survey. 

Speech ratings for each subject were 
computed by taking the average of all 
speech ratings the subject received. The 
number of such ratings varied from sub- 
ject to subject in accordance with the 
size of his particular section and the 
number of his section 
during the rating sessions. The smallest 
number of ratings received by any sub- 


absences from 


ject was twelve, the greatest number was 
twenty one, the average was fifteen. Ex- 
amination of Table II reveals a tendency 
for the distribution of speech ratings to 
cluster towards the superior end of the 
scale. Student judges in all sections 


tended to ignore inferior ratings of six 
and seven. In essence this resulted in a 
five-point rating scale instead of seven 
points with the mid-point of three in- 
stead of at four. 


II. Analysis of and 
Inter-Sex Differences on the Variables. 
Since the subjects for this study were 
drawn 


Inter-Section 


from twelve public speaking 
classes and included female as well as 
male subjects, it was necessary to evalu- 
ate the possibility of inter-section and 
inter-sex differences on the variables be- 
fore any comparison among the variables 
could be made. 


The analysis of variance technique 
described by McNemar® was used to 
evaluate differences. The 
results of this analysis indicated that 
there were no sectional 


inter-section 


differences on 
the aptitude tests scores nor on speech 
ratings. 


6 McNemar, Q., Psychological Statistics, New 
York: John Wiley and Sons, 1949, p. 236. 
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In order to determine possible male 
and female differences on the variables, 
t-ratios were computed between the dif- 
ferences of the means of the sexes for 
each variable. The only significant dif- 
ference between the sexes was found on 
the General Reasoning test scores. 

Because of the difference between the 
sexes on the General Reasoning test 
scores and because of other differences 
revealed by statistical analysis which 
lies outside the scope of this report, the 
raw scores on all rated or measured vari- 
ables were converted standard 
score scale. The stanine scale was se- 
lected because of its simplicity and ade- 
quacy of spread. 

III. Comparison between the aptitude 
test scores of the upper and lower 
twenty-seven per cent of speech ratings 
received by the The upper 
twenty-seven per cent of males on the 
speech ratings were found among stan- 
ines nine, eight, and seven. These 
stanines represented those males whose 


into a 


males. 


speaking ability was rated among those 
most effective. The 
twenty-seven per cent were found in 
stanines three, two, and one or those 
males rated among those less or least 
effective in speaking. The means of 
scores on the aptitude tests for both 
groups were then compared by comput- 
ing t-ratios between the differences of 
the means. 


more or lower 


The t-ratio for Verbal Comprehen- 
sion was found to be 3.24 and for Gen- 
eral Reasoning 3.62. With one hundred 
subjects as the basis for comparison, a 
t-ratio to be significant at the .o1 level 
had to exceed 2.63.’ 

IV. Coefficients of Correlation between 
Speech Ratings and the Appitude Test 
Scores. The stanine values were used to 


7 Guilford, J. P., Fundamental Statistics in 
Psychology and Education, New York: McGraw- 
Hill, 1956, pp. 538-39- 


compute Pearson product-moment coef- 
ficients of correlation among the vari- 
ables. Inspection of Table III reveals 
that for the male subjects of this study 
the intercorrelations of .25 and .24 
respectively indicate that a small but 
significant relationship exists between 
Speech Ratings and both Verbal Com- 
prehension and General Reasoning. 
These correlations were significant at 
the .o1 level of confidence. 

Further inspection of Table III shows 
that there was practically an absence of 
correlation between Speech Ratings for 
the females and the aptitude test scores. 
This difference in correlation for the 
males as opposed to the females could 
be due to the small number of females 
in each section and the overall small 
sample of females. It might be that fe- 
male speakers in being received well are 
dependent upon different 
skills from the males. 


factors or 


V. Predicting speech ratings when 
verbal 


reasoning tests scores are known. Pre- 


comprehension and_ general 
dicting the speech rating an individual 
would receive knowing his Verbal Com- 
prehension and General Reasoning test 
scores required the computation of a 
multiple coefficient of correlation, R, 
between the speech ratings and a com- 
bination of the scores on the aptitude 
with their inter-correlations 
considered. The multiple correlation 
technique described by Guilford*® was 
used in this computation. 

Table IV presents the results of this 
computation. Multiple coefficients of 
correlation, R, are subject to the same 
interpretation as the coefficient of cor- 
relation, r. Inspection of Table IV re- 
veals that for the males and both sexes 
combined the R of .31 was significant 
at the .o1 level of confidence, but the 
extent of the relationship would give 


variables 


8 Guilford, op. cit, pp. 401-404. 
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TABLE III 
COEFFICIENTS OF CORRELATION AND STANDARD ERRORS OF CORRELATION BETWEEN SPEECH 
RATINGS AND THE APTITUDE TEST SCORES FOR MALES, FEMALES, AND 
THE COMBINED SEXES. 








Speech Ratings 











Aptitude Tests Males Females Combined 
vc* 25 + .06 02 21 + .06 
GR** 24 + .06 02 21 + .06 








* Verbal Comprehension. 
** General Reasoning. 


r’s needed for .or level of significance 








Males (N—187) 19 

Females (N—34) 43 

Combined Sexes (N—221) 15 
TABLE IV 


MULTIPLE CORRELATIONS, STANDARD ERRORS OF CORRELATION, COEFFICIENTS OF DETERMINATION, 
AND INDICES OF FORECASTING EFFICIENCY BETWEEN SPEECH RATINGS AND A 
COMBINATION OF THE APTITUDE TESTS SCORES, WITH 
THEIR INTERCORRELATIONS CONSIDERED, FOR THE 
MALES, AND THE COMBINED SEXES. 




















r R R2 
Subjects jn VC and SRR VC and GR GR and SRR oy E 

Males (N-187) 25 .29 25 gi? 09 
07 05 

Combined (N-221) .22 .23 21 * 27 07 
06 02 








* Significant at the .o1 level. 


little assistance in 


ratings from a combination of the apti- 


predicting speech 


tude test scores. 

A coefficient of determination, R?, 
was also computed and yielded a value 
of .og which indicates that the variance 
present in the two aptitude tests is ac- 
counting for less than ten per cent of 
the variance present in the speech rat- 
ings received by the males. Further, the 
index of forecasting efficiency, E (.05), 
when expressed as a percentage shows 
that our margin of error in predicting 
speech ratings from the aptitude tests 
scores is only five per cent less than 
would be the errors of prediction with- 
out the tests. 


SUMMARY AND CONCLUSIONS 
The purpose of this study was to 
ascertain the relationship between speak- 


ing ability and two of the primary 
mental abilities, Verbal Comprehension 
and General Reasoning. The following 
conclusions were reached as a result of 
this study: 

(1) There is a small but significant re- 
lationship for the subjects of this study 
between their speaking ability and their 
Verbal and Reasoning abilities. The ex- 
tent of the relationship is too slight, 
however, to allow for accurate pre- 
The predictive indices 
established from the data are only slight 
improvements over chance predictions. 


dictive data. 


(2) When considered separately, both 
Verbal Comprehension and _ General 
Reasoning show the same degree of low, 
positive correlation with speaking 
ability for the male subjects and a 
practical absence of correlation for the 
female subjects. Neither test may be 
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said to contribute more than the other 
to speaking ability. 

(3) The low, positive correlation be- 
tween speaking ability and the aptitude 
tests for the males as opposed to the 
practical lack of correlation for the fe- 
males on these variables could have been 
due to the small number of females per 
section and the overall small sample of 
females. It might also indicate that fe- 
male speakers in being received well are 
different skills or 


dependent upon 


abilities from the males. The former is 


the more probable explanation for this 
difference. 

(4) The results of this study, when 
compared with results obtained in pre- 
vious investigations of the relationship 
between speaking ability and general 
intelligence, indicate a large measure of 
agreement in that low, positive corre- 
lations were the trend. This may be ac- 
counted for by the probability that the 
various measures of general intelligence 
used in prior investigations contained 
significant loadings of verbal and reason- 
ing factors in their construction. 
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ORAL AND NASAL SOUND PRESSURE LEVELS OF VOWELS 


RAYMOND SUMMERS 
Indiana State Board of Health 


‘HE purpose of this experiment was 
‘4 to compare oral and nasal sound 
pressures of eight vowel sounds which 
were produced at four specified sound 
pressure levels by male and female sub- 
jects. 


I. SupyecTs, APPARATUS, AND PROCEDURE 
Thirty subjects were used in the ex- 
periment, 16 males and 14 females. 
Their ages ranged from 20 to 41 years. 
Professional listeners judged the sub- 
jects as having typical, General American 
vowel production and good voice quali- 
ty (with particular emphasis placed on 
freedom from nasality). They also pos- 
sessed an ability to ._produce vowels at 
specified sound pressure levels and they 
demonstrated an ability to 
isolated speech sounds. 


prolong 


Two independent recording systems, 
one of which included a_probe-tube, 
were used to record the speech samples. 
The probe-tube was a three-inch length 
of seamless nickel tubing which was 
coupled to the diaphragm of a Western 
Electric Model 640-AA condenser micro- 
phone. The length of the probe-tube 
was partially determined by the fact 
that the coupler, condenser microphone, 
and its associated pre-amplifier (Audio 
Instrument Model 16W) 
should be placed below the mouth in 
order to minimize interference with the 
sound field. 


Company 


To quantify the oral and nasal sound 
pressures, graphic recordings of the 
speech samples were obtained using a 


*Based upon Ph.D. dissertation, Purdue Uni- 
versity, 1955, directed by M. D. Steer. 


Sound Apparatus Company High Speed 
Level Recorder (HPL-E). Also used were 
an Ampex Recorder and a Stancil-Hoff- 
man Recorder, the same instruments 
used to record the speech samples origi- 
nally. A paper speed of 10 mm/second 
was employed on the HPL-E and, using 


the maximum damping setting, the 
graphic recording tended to _ be 
smoothed. 


The vowels used in: this experiment 
were [i/ e/ &/ x/ a/ 9/ o/ uj. Although 
[e] and [o] are frequently the diph- 
thongs, [er] and [ou], the subjects at- 
tempted to produce the pure vowels. 
The four sound pressures studied were 
60, 70, 80, and go decibels (re .ooo02 
dyne/cm?) when the microphone of the 
sound level meter was eight inches from 
the subject’s mouth. 

One important aspect of this experi- 
ment involved two independent record- 
ing systems which were essentially identi- 
cal in their frequency response. These 
two systems were calibrated with 
random noise in a sound treated room. 
Since the type of calibration depends 
upon the use of the probe-tube, a free 
field calibration of the probe coupled 
to the microphone was employed. 

Two sound treated rooms, with a win- 
dow in the wall between them, were 
used for recording the vowel sounds. 
The two recorders and a technician oc- 
cupied one room and the subject and 
experimenter occupied the other. The 
subjects, upon arriving for their in- 
dividual appointments, sat in a medical 

1R. W. Benson, “The Calibration and Use 


of Probe-tube Microphones,” Jour. of Acousti- 
cal Society of America, XXV_ (1953), 128-134. 
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type examination chair in such a posi- 
tion that they could observe a sound 
level meter which indicated the sound 
pressure at which the vowels were being 
produced. The microphones of the 
sound level meter and the oral recording 
system were juxtaposed and maintained 
at a distance of eight inches from the 
mouth for all subjects. These two micro- 
phones were within 1/4th of an inch of 
each other so that both were influenced 
by approximately the same sound pres- 
sure field.? 

After a practice period, during which 
the subjects produced vowels at sound 
pressure levels of 60 to go db, the probe- 
tube was inserted into the subject’s nasal 
meatus approximately 1/4th of an inch. 

The vowels to be produced and the 
sound pressure levels to be attained 
were written on cards and were visible 
to the subjects. Both sounds and levels 
were randomly presented although all 
vowels within one sound pressure level 
were produced successively. 

In the analysis of the graphic sound 
pressure levels, the HPL-E graphs were 
aligned vertically, by time, so that 
measurements of the same portion of the 
oral and nasal recordings were made. To 
measure the same duration for each 
vowel, the experimenter marked off 
four-centimeter lengths on the HPL-E 
graphs representing four seconds of 
recording. Three factors were considered 
in locating the section to be measured. 
The section measured was not at either 
the initiation or cessation of the record- 
ing, constant sound pressure was indi- 
cated, and continuous phonation was 
manifested. 


2The “oral” sound pressure, obtained eight 
inches from the mouth, actually constituted an 
“overall” sound pressure since it was influenced 
by sound coming from both the mouth and 
nose. It will be referred to as the “oral” sound 
pressure while the sound pressure obtained 
with the probe-tube will be referred to as 
being “nasal.” 


The sound pressure measurements on 
the HPL-E graphs were made by visual 
inspection and all were verified by an 
independent examiner. To check the ex- 
perimenter’s reliability, a third person 
measured the 64 sound pressure levels of 
one subject and a coefficient of reliabil- 
ity between the experimenter and this 
third person was .9g. 


II. REsuLTs 

Oral and nasal sound pressures of 
eight vowels which were produced at 
four sound pressure levels (SPLs) by 30 
subjects were collected. The data were 
treated by an analysis of variance with 
subjects, groups, vowels, sound pressure 
levels, and locations constituting the 
main effects. All of these were treated as 
fixed variables except subjects which 
constituted a random variable. The er- 
ror term was the interaction of subjects- 
within-groups with the appropriate 
main effects and interactions. Table I 
is the analysis of variance for the total 
experiment. 

Since the experimenter was primarily 
interested in comparing relative dif- 
ferences in oral and nasal SPLs, the 
main effect of location and the first- 
order interactions involving 
were of importance.® 


location 


Location. Location refers to the nasal 
meatus (nasal location) and a position 
eight inches from the mouth (oral lo- 
cation). When averaged across groups, 
vowels, and SPLs, the mean oral sound 
pressure level was 71.4 db and the mean 
nasal sound pressure level was 104.1 db, 
a difference which is significant at the 
one per cent level. This difference, par- 
tially attributed to the fact that the oral 
SPLs were obtained eight inches from 
the mouth whereas nasals SPLs were ob- 


8Only first-order interactions are reported 
here. All possible interactions were studied 
and they are reported in the dissertation. 
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TABLE I 
ANALYSIS OF VARIANCE FOR THE MAIN EFFORTS OF 
Supjects (S), Groups (G), VOWELS (V), SOUND 
PRESSURE LEVELS (P), AND LOcATIONS (L), 
AND ALL POSSIBLE INTERACTIONS. 











Source of Mean 
Variation df Square ies 
G 1 16.04 08 
Subjects-within- 
groups 28 203.99 
Vv 7 38.85, 6.97 
VxG 7 6.30 1.13 
V x Subjects-within- 
groups 196 5-57 
y 33 45,198.91 1,230.57 
PxG 3 83.99 2.29 
P x Subjects-within- 
groups 84 36.73 
L 1 511,038.01 4,330.46 
LxG 1 1,691.77 14.34 
L x Subjects-within- 
groups 28 118.01 
¥V=xP 21 6.81 2.29 
VxPxG 21 4-74 1.60 
V x P x Subjects- 
within-groups 588 2.97 
rs i. 3  =—-1,240.39 49.85 
Px LxG 3 114.05 4-58 
P x L x Subjects- 
within-groups 84 24.88 
Vx L 7 54-19 12.46 
VxLxG ‘j 10.30 2.37 
V x L x Subjects- 
within-groups 196 4-35 
a ee Se 21 3-74 1.76 
VxLxPxG 21 3.86 1.82 
V x L x P x Subjects- 
within-groups 588 2.12 
Total 1,919 








*The critical points for tests with the follow- 
ing degrees of freedom: 


df 1% 5% 
1.28 7.64 4.20 
7,200 2.73 2.05 
3,80 4.04 2.72 
20,400 1.92 1.60 


tained well within the nasal meatus, is 
so great that a formal test of differences 
was unnecessary.* 


First-Order Interactions Involving Lo- 
cation. All two-factor interactions in- 
volving location were significant at the 
one per cent level. When averaged across 
vowels and nominal’ sound pressure 


4Due to the large difference between the 
oral and nasal sound pressures, any term in- 
volving locations may violate the assumption 
of additivity which underlies the analysis of 
variance. 


levels, the mean sound pressure levels 
for the oral location were 72.2 db and 
70.5 db for the males and females re- 
spectively and, for the nasal location, 
103.1 db and 105.2 db for the same 
respective groups. Thus, the female sub- 
jects had a lower oral sound pressure 
level and a higher nasal sound pressure 
level than the males. According to Table 
II, the mean figures for the SPL-by-loca- 
tion interaction indicate that the differ- 
ences between the nominal SPLs for the 
oral recordings are greater than the 
the SPLs for the 
nasal recordings. The vowel-by-location 
interaction (Figure 1) indicates not only 
that the nasal SPL for each vowel is 
considerably higher than the oral SPL, 
but also that when an attempt is made 


differences between 


to hold the oral sound pressure at a con- 
stant level, some factor is operating 
which makes the nasal sound pressure 
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Fic. 1—Graphic representation of the vowel-by- 
location interaction (significant at the one per 
cent level). Oral microphone was located eight 
inches from the mouth and the nasal, probe- 
tube microphone was inserted in the nasal 
meatus. 


5The actual SPLs were 58.4, 66.9, 76.0, and 
84.4 db (re .ooog dyne/cm2) and these are re- 
ferred to in this study as “nominal” sound 
pressures of 60, 70, 80, and go db respectively. 
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TABLE II 
RECORDED MEAN ORAL AND NASAL SOUND 
PRESSURE LEVELS IN DB (RE .0002 DYNE/ 
CM?) WHEN AVERAGED OVER ALL SOUNDS 
FOR EACH NOMINAL SOUND PRESSURE 
FOR THE ‘TOTAL GROUP 








Nominal Sound Sound Pressure Level 





Location Pressure Level for Total Group 
Oral 60 58.4 
Oral 70 66.9 
Oral 80 76.0 
Oral go 84.4 
Nasal 60 93-7 
Nasal 70 101.9 
Nasal 80 108.0 
Nasal go 112.7 








for the high front vowels and the high 
back vowels greater than that for the 
middle vowels, a pattern similar to the 
vowel triangle with the exception that 
the lowest SPL is shifted to [5] instead 
of being at [a].° This is graphically rep- 
resented in Figure 1. This might be at- 
tributed to the variation in the cross- 
sectional area of the mouth during the 
production of the vowels, ie., open 
mouth for low vowels and consequently 
low pressure and close mouth for high 
vowels and thus higher pressure. 


Groups. Groups consisted of male and 
female subjects who were randomly se- 
lected from a college population. When 
averaged across vowels, SPLs, and loca- 
tions, the mean SPL for the females was 
87.9 db and for the males, 87.7 db. Ac- 
cording to the analysis of variance, the 
difference between the groups is not sig- 
nificant. With a visual cue to indicate 
when the proper sound pressure is ob- 
tained, expect the mean 
sound pressures to be similar for the 
two groups. 


one would 


6In interpreting the graphic representations 
of the interactions, attention should be called 
to the parallel character or lack of parallel 
character which is represented. It should be 
noted that if the lines connecting points on 
the graphs are parallel from vowel to vowel, 
the interaction is not significant. If the lines 
are not parallel on the graphs, the interaction 
is significant. 


Vowels. 
studied 


Fight vowel sounds were 
and when averaged across 
groups, sound pressure levels, and _ lo- 
cations, the mean SPLs for these vowels 
were: [i], 88.5; [e], 88.0; [e], 87.6; [ze], 
87.7; [a], 87.6; [5], 87.1; [o], 87.5; [ul], 
88.0. The analysis of variance indicates 
that the differences among these sound 
pressures are significant at the one per 
cent level. 

Sound Pressure Levels. Each 
was produced at four specified sound 
pressure levels. When averaged across 
groups, vowels, and locations, the mean 
SPLs were 76.0, 84.4, 92.0, and 98.5 
db (re .ooog dyne/cm?*) for the 60, 70, 
80, and go db levels respectively. These 
differences are significant at the one per 
cent level. 


vowel 


The discrepancy between the 60, 70, 
80, and go db nominal sound pressure 
levels and the exact, mean sound pres- 
sure levels of 58.4, 66.9, 76.0, and 84.4 
db (re .ooog dyne/cm?*) respectively at 
which the vowels were actually produced 
is attributed to the use of random noise 
in the calibration process. It was not 
expected that the production of vowel 
sounds at these SPLs would necessarily 
be equivalent to the random noise sig- 
nals. Thus, the difference between pre- 
determined SPLs, based on speech sig- 
nals, and nominal SPLs, based on ran- 
dom noise, is understandable. 

Other interactions were 
studied also. Two of these interactions, 
vowel-by-group and SPL-by-group, are 
not significant. The vowels-by-SPL inter- 


first-order 


action (Figure 2) is significant at the one 
per cent level. 


II. 


In this experiment, 30 subjects pho- 
nated eight different vowel sounds at 
four specified sound pressure levels and 
these sounds were recorded by two inde- 
pendent recording systems, a probe-tube 


SUMMARY AND CONCLUSIONS 
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Fic. 2—Graphic representation of the vowel-by- 
sound pressure level interaction (significant at 
the one per cent level). 


microphone being used to pick up the 
sound pressure in the nasal meatus and 
a conventional, studio microphone be- 
ing used to pick up the sound pressure 
eight inches from the mouth. 


After recording the speech samples, 
graphs of the sound pressures were 
recorded on HPL-E tapes and _ subse- 
quent sound pressures for each subject 
were obtained. The statistical treatment 
of the data was by analysis of variance. 


The following conclusions can _ be 
made: 


(1) In regard to the hypotheses which 
were set out to be tested, there are no 


significant sound pressure differences 
between the two groups studied. How- 
ever, there are significant sound pressure 
differences among the eight vowels that 
were studied, among the four sound 
pressure levels that were studied, and 
between the two 
studied. 


locations that were 


(2) There are differences between and 
within oral and nasal sound pressure lev- 
els when different vowels are produced 
at different sound pressure levels by 
male and female subjects. 

(3) The differences in sound pressure, 
within the conditions under which the 
data were collected, are attributable to 
differential effects of the primary sources 
of variation, and to interactions among 
these variables. A consideration of these 
main effects and the first-order inter- 
actions resulted in the following rela- 
tionships: (a) The females have a lower 
oral SPL, across all sounds and SPLs, 
than the males. The converse is true for 
the nasal SPLs. (b) The differences 
among the oral SPLs, for both males and 
females, are greater than the differences 
among the nasal SPLs. (c) The mean 
of the four nasal SPLs for each vowel, 
when plotted, resembles the conven- 
tional vowel triangle except that the 
lowest sound pressure is for [9]. Prob- 
ably because of visual monitoring, this 
relationship does not exist for the oral 
sound pressure. (d) There are no sound 
pressure differences between male and 
female groups in producing vowel 
sounds at the four nominal SPLs. (e) 
The mean oral and nasal SPL for each 
vowel that differences 
the vowel sounds 
nominal SPLs studied. 


exist 
at the four 


indicates 
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OBSERVED AND PREDICTED ESTIMATES OF RELIABILITY 
OF ASPECTS OF A SPEECH ARTICULATION RATING SCALE 


NANCY MARCH, CARL H. WEAVER,1! SHEILA MORRISON, and JOHN W. BLACK 
The Ohio State University 


HE purpose of the study reported 

here was to determine the economies 
that might be introduced into a par- 
ticular speech-rating technique,* econ- 
omies in judges, items, and subjects. Pro- 
cedures relevant to this objective pro- 
vide an opportunity to illustrate and 
compare some of the alternative methods 
that are available to the researcher who 
is confronted with matters of reliability 
in tests that involve ratings by a panel 
of judges. 


THE SCORES 


Morrison employed a rating proce- 
dure, modeled upon the method of 
equal-appearing intervals, to obtain 
scaled samples of children’s articulation. 
The ratings ranged from one (good) to 
nine (poor) in integer units.? This de- 
velopment was patterned after Thur- 
stone’s work in scaling attitudes* and 
exploited the work of Lewis and Sher- 
man who scaled samples of stuttering in 
terms of severity. Morrison’s graded 
samples of children’s speech have been 
used in training speech therapists to 
assess children’s articulation and—of 
more relevance to the present report— 
in researches that require the training 


*Part of this study derives directly from 
a master’s thesis by Nancy March under the 
direction of John W. Black, Ohio State Univer- 
sity, 1957- 

1 Now at Central Michigan College. 

2 Sheila G. Morrison, “Measuring the Sever- 
ity of Articulation Defectiveness,” JSHD, XX 
(December, 1955), 347-351- 

8L. L. Thurstone and E. J. Chave, The 
Measurement of Attitude (Chicago, 1929). 

4D. Lewis and D. Sherman, “Measuring the 
Severity of Stuttering,” JSHD, XVI (December, 


1951), 320-326. 


of judges to evaluate the merit of ar- 
ticulation in children’s speech. 

In the present instance the samples 
that were evaluated were from conversa- 
tional speech that was elicited from forty 
children before and after a term of 
speech therapy. For the most part, the 
therapy extended over a ten-week term 
with a single three-hour session of ther- 
apy each week. The recorded material 
was edited to remove the speech of the 
interviewer, and the remaining segments 
of speech were adjoined to make one 
minute of continuous speech per sub- 
ject. This record was then segmented 
into six ten-second samples (items). The 
480 items (forty subjects, six items per 
subject for both pre- and post-therapy 
conditions) were randomized for presen- 
tation to ten judges, who were advanced 
students in speech and hearing. 

The judges were trained in a manner 
recommended by Morrison. A training 
session of about one and one-half hours 
occurred on the day preceding the ex- 
perimental judging session. Judges lis- 
tened to samples of children’s speech 
that illustrated the range of the scale, 
and were then given instructions for rat- 
ing similar segments of speech: that 
the rating should be based on articula- 
tion throughout the segment; that the 
judges should avoid the influence of 
general effectiveness insofar as possible; 
and that the scale extended from one, 
representing the least defective articula- 
tion, to nine, the most defective articula- 
tion. The judges then made practice 
judgments of thirty-six samples that 
were arranged in random order. These 
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samples were presented a second time, 
the experimenter announcing the pre- 
viously established level of severity of 
each in order that the judges might 
evaluate their practice judgments. 

The foregoing procedure involving 
thirty-six samples was repeated and fur- 
ther practice was provided with the 
presentation of twenty additional seg- 
ments. 

A further similar training session of 
about forty minutes preceded each 
subsequent experimental judging s-s- 
sion. 

Apart from the training sessions the 
judges provided 4800 responses that were 
specific to the present study, one half 
relating to the pre-therapy samples and 
one half to the post-therapy samples. 
The mean of the pre-therapy ratings 
was 5.17 + 1.58 and of the post-therapy 
ratings, 4.94 + 1.71. The difference be- 
tween the two means was statistically 
significant at the 5 per cent level of 
confidence (t, 2.05; 39 d.f.).5 For the 
purpose of testing the reliability of 
judges, of items, or of subjects, either 
set of scores would serve equally well. 
Only the pre-therapy scores were used. 


RELIABILITY OF THE JUDGES 


Four procedures were followed in 
studying the reliability of the judges. 
Procedure z involved dividing the ten 
judges into odd-even groups and de- 
termining the Pearson product-moment 
correlation coefficient between the meas- 
ures that represented the subgroups of 
judges. Similarly, eight of the judges, 
selected by chance from the ten, were 
subdivided and the correlation between 
the two sets of measures was computed. 
In like manner, six, four, and two were 


5A comparison of the two means is inci- 
dental to the present treatment. The product- 
moment correlation coefficient between the pre- 
and post-therapy scores for the forty subjects 
Was .50. 


selected and the correlation coefficient 
determined. By this procedure, the relia- 
bility coefficients of five, four, three, two, 
and one judge were found tc be re- 
spectively: .g5, .94, .go, .85, and .58, 
enumerated in the second column of 
Table I. 

The foregoing correlations were made 
the bases of prediction through a series 
of applications of the Spearman-Brown 
prophecy formula. The predictions 
made by this formula hold if the test 
items are equivalent, i.e., if they have 
the same difficulty and if the item inter- 
correlations are equal,® and may be 
used for stating the reliability of tests 
that are either longer or shorter than 
the original.t Table I enumerates in 
columns 3-72 forty-five predicted reli- 
ability coefficients that were derived by 
applying the Spearman-Brown formula 
fo reach possible N of judges to the ob- 
served values of column 2. 


Inpection of Table I suggests that a 
sampling error occurred in the instance 
of one judge vs. one judge (r = .58). The 
forty-five possible comparisons of single 
judges are enumerated in Table II. Only 
seven of the values are as low as .58. 


Thus in Procedure 7 an assumption 
was made that panels of 10, 8, 6, 4, and 
2 judges had participated. In panels of 
fewer than ten members the “members” 
of the groups were selected randomly 
from the full panel of ten members. 
Split-half correlation was used in each 
instance to yield an observed indica- 
tion of agreement between two parts of 
the sample. These five observed values, 
in turn, were made the bases for a series 
of predicted correlations for panels of 


6H. H. Remmers, “The Equivalance of 
Judgments to Test Items in the Sense of the 
Spearman-Brown Formula,” J. Ed. Psychol., 
XXII (January, 1931), 66-71. 

7 Dorothy Adkins, Construction and Anal- 
ysis of Achievement Tests (U. S. Govt. Printing 
Office, 1947). 
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TABLE I 
PREDICTED VALUES FOR r FOR VARIOUS NUMBERS OF JUDGES USING OBSERVED 
PRODUCT-MOMENT COEFFICIENTS AS THE BASES FOR PREDICTION. 








Observed r 


Predicted r 











Number of Judges 1 2 3 4 5 6 ” 8 9 10 
5 95 79 88 92 94 —- 6 6 997 97 97 
4 94 80 89 .g2 — 95 .96 .g6 97 97 .98 
3 go 75 86 — 92 % 9 9 96 96 97 
2 85 74 — 8 92 9 94 9 96 6 97 
1 58 —_ 73 81 85 87 89 gl .Q2 93 93 
TABLE II 


AN INTERCORRELATION MATRIX OF THE RATINGS OF EACH JUDGE 
WITH EACH OF NINE OTHER JUDGES. 











Judge 

Number 1 2 3 4 5 6 7 8 9 10 
1 a 81 83 80 63 67 58 74 62 ‘61 

2 — 86 82 .69 72 65 By 68 49 

3 ~= 87 74 79 75 8o 4% -72 

4 — 7 ‘74 73 ‘74 69 72 

5 a 52 ‘57 58 47 53 

6 + 79 97 89 86 

7 _ 95 85 ‘79 

8 — 83 85 

9 — 84 
10 =. 








judges ranging from one to ten mem- 
bers. 


Apparently a panel of four or five 
judges of the level of the present sample 
would be sufficient to yield a reliability 
of r=.g2 to .g5, often a satisfactorily 
high value in rating situations. It should 
be remembered that these judges were 
trained by Morrison’s method. 


Procedure 2 involved one method of 
computing average  intercorrelation. 
Since, for example, the value .7g in 
Table I was obtained by applying the 
prophecy formula to the coefficient ob- 
tained by a 5-5 division of the judging 
panel, it may be considered an estimate 
of the reliability of one random judge 
from the panel, or an estimated correla- 
tion between the ratings of any two 
randomly selected judges. This kind of 
value, more precise than the one pro- 
vided by split-half 
termed average 


correlation and 
intercorrelation, was 


computed by means of a formula offered 


by Edgerton and Toops.* Historically, 
their development obviated the need for 
either working with ranked data (Kelly 
derived a procedure for 
average intercorrelation 


determining 
from ranked 
data®) or constructing an intercorrela- 
tion matrix and averaging the series. 
Computationally the Edgerton-Toops 
procedure is cumbersome. In the present 
instance it yielded an average intercor- 
relation coefficient of .73, the same value 
that might be obtained by averaging the 
five values of Table I that pertain to 
one judge: .79, .80, .75, 
This value (.73), in turn, if extended 
by the Spearman-Brown formula would 


74, and .58. 


yield essentially the values of row 4 of 
Table I. Of especial relevance, however, 


8 Harold A. Edgerton and Herbert A. Toops, 
“A Formula for Finding the Average Inter- 
correlation Coefficient of Unranked Raw Scores 
Without Solving Any of the Individual Inter- 
correlations,” J. Ed. Psychol., XIX (February, 
1928), 131-138. 

9 Truman L. Kelley, Statistical Method (New 
York, 1923). 
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the forty-five values of Table II illus- 
trate ones that were superseded by the 
single formula for average intercorrela- 
tion, since they are coefficients of cor- 
relation of the ratings of each judge 
with those of each other judge. An 
average intercorrelation might be com- 
puted from them simply by averaging 
them. In the present instance the mean 
of these forty-five measures is .73, identi- 
cal to the value yielded by the Edgerton- 
Toops formula. 

Procedure 3 was the computation of 
the average intercorrelation coefficient 
by an analysis-of-variance technique, us- 
ing a formula developed by Horst’? and 
rewritten and extended by Ebel and 
Hoyt:" 12 


°_= 


similar and might be shifted to the right 
or left at will. The variance would be 
divided between rows and remainder. 


The entries in the present matrix were 
the individual judges’ pooled ratings 
for the six items that originated with 
one subject. This situation is hardly 
the one described in the preceding para- 
graph. Columns in the present case are 
expected to be stratified, each judge’s 
ratings being more like each other than 
like his fellows’ ratings. As disparity 
among judges (columns) increases with- 
in a particular matrix of values, the re- 
mainder or error variance decreases. For 
example, four hypothetically “similar” 
judges might lead to the following frac- 





(Variance for subjects or items) minus (remainder variance) 





(Variance for subjects or items) plus (N-1 judges [remainder variance]) 


In an instance like the present one, 
the data for the analysis are entered on 
computation sheets in the same manner 
as for the foregoing Edgerton-Toops 
procedure: rows represent persons be- 
ing judged and columns epresent 
judges. The total variance is divided 
among columns (variance for judges), 
rows (variance for experimental sub- 
jects or items), and remainder. The pro- 
cedure is an application of intraclass 
correlation. 


In intraclass correlation, typically, the 
entries in a row of the matrix are mere- 
ly samples from a common source. For 
example, the entries might be the num- 
ber of words per page of a book. In this 
example different books would be rep- 
resented in Columns 
would have little in common. Measures 


different rows. 


within a row, however, might be quite 


10 Paul Horst, “A Generalized Expression for 
the Reliability of Measures,” Psychom., XIV 
(March, 1949), 21-31. 

11 Robert L. Ebel, “Estimation of the Reli- 
ability Ratings,” Psychom., XIV (December, 
1951), 407-424. 

12 Ibid. 


tion: 


_100—% 8 15 

100 + 3 (25) 175 
Four hypothetically dissimilar judges 
might cause the small values (‘soft” 


judges) to fall in particular columns 
and the large numbers of the “hard” 
judges to fall in other columns. A larger 
proportion of the total variance would 
be attributed to judges and a smaller 
proportion to remainder. The fraction 
might become 


_100—15 _ 85 


100+ $(15) 145 
Thus, a high intraclass correlation value 
may suggest that a judge (column) is 
consistent with himself from judgment 
to judgment and that the values within 
a row may not be juxtaposed with a 
negligible effect upon the outcome. This 
characteristic of the rows of a matrix 
(equality of the values in a row) that is 
taken for granted in intraclass correla- 
tion is hardly expected in the rating cir- 
cumstance. The apparent discrepancy, 
however, between the intraclass correla- 
tion and doubly stratified scores (strati- 
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fied within columns and stratified within might be further employed in lieu of 

rows) is minimized with a rows x _ split-half correlation and the Spearman- 

columns treatment of the data so that Brown formula to estimate the reliabil- 

Guilford speaks of intraclass correlation ity of N judges if the judgments are at 

as “essentially an average intercorrela- hand. Ebel, following Hoyt, proposed 

tion.”’28 the following formula for this purpose 
The method under discussion yielded (the same numerator as above): 





—. (Variance for subjects or items) minus (remainder variance) 
rT => —s : . 
Variance for subjects or items 





a correlation coefficient for a random The estimated reliabilities for N = 
member of the judging panel of .72, two randomly selected judges from the 
essentially the same as the other ob- group of ten, three randomly selected 
tained values. Were this to be extended judges, . . . and ten judges were: two 
by the Spearman-Brown formula the judges, .83; three judges, .86; four 
results would be identical to ones al- judges, .87; five judges, .g3; six judges, 
ready obtained. (This result may be .g2; seven judges, .g4; eight judges, .g5; 
checked by applying the formula to the nine judges, .g6; and ten judges, .g6. 
data for ten judges from Table III.) These are reliabilities for panels, not 
The same formula was applied to suc- for an average member of the panel. 

cessively smaller samples of the data The foregoing material has treated 
as one judge (column) was dropped at three procedures for appraising the reli- 
random, two judges were dropped at ability of the judges who, after practic- 
random, etc. The obtained values were: ing with scaled samples, attempted to 
from the data of nine judges, .71; from assign values to additional samples of 
the data of eight judges, .72; from seven children’s articulation. The methods are 
judges, .71; from six judges, .67; five (a) split-half product-moment correla- 
judges, .73; four judges, .63; three tion, (b) Edgerton-Toops average inter- 
judges, .68; and two judges, .71. The correlation, and (c) analysis of variance. 
data from which these values were com- All of the estimates can be extended to 
puted are enumerated in Table 3. larger and smaller samples by means of 


TABLE III 
SUMMARY OF THE DISTRIBUTION OF VARIANCE AMONG THE RATINGS OF 10 JUDGES AND OF 
9, 8, 7, 6, 5, 4, 3, 2 JUDGES SELECTED RANDOMLY FROM THE 10. 
(Degrees of freedom are indicated in parentheses) 








Number of Judges 
2 3 


Source of Variation 
Judges — Columns 
32(2) 21.84(2) 13.49 (3) 9-79 (4) 10.51 (5) 3-63 (6) 761 (7) 7.52 (8) 7-25 (9) 
Subjects = Rows (39) 

4.68 8.29 10.16 15-79 14.66 16.68 22.09 22.11 25.59 
Remainder 


-78 (39) 1.18(78)  1.28(117) 1.11 (156) 1.12 (195) .94 (234)  1.01(273) .95(312) 95 (351) 


_ 
or 


6 7 8 9 10 














The procedure employed in comput- the Spearman-Brown prophecy formula. 
ing the foregoing intraclass correlations Also, the use of analysis of variance for 
and the obtained estimates of variance _ stating the reliability of an average judge 
leads to the application of a closely re- 


13]. P. Guilford, Psychometric Methods (sec. 4 cachet ” tenis aff veal 
ed., New York, 1954). lated technique, also analysis of variance, 
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for stating the reliability of a panel of 
judges. 

Finally, the Spearman-Brown formula 
may be invoked to solve for N judges 
that would be expected to yield any de- 
sired coefficient of reliability, one of the 
obtained reliabilities for the average 
judge being inserted into the formula 
as a “known” value: 


with the reliability of judges were 
applied to six, five, four, three, two, 
and one item and the Spearman-Brown 
prophecy formula applied to each as a 
basis for prediction. The observed and 
predicted values are enumerated in 
Table V. Row z of Table V includes 
the reliability of one typical item (.85). 
This value is presumably more repre- 





(Desired coefficient) (1 minus observed coefficient) 





4 


Thus, using the Edgerton-Toops 
value from above as the observed co- 
efficient and arbitrarily selecting .go as 
a satisfactory outcome, the solution for 
the data used here is: 


go (1- .73) 


N= —— 
-73 (1- .go) 


= 3.26 judges.13 


RELIABILITY OF TEST ITEMS 


The items in the present test were 
ten-second samples of speech from which 
the speech of the source of the six items 
was to be evaluated. Obviously, this 
test should be no longer than it need 
be to measure the speaker reliably. The 
number of items or speech samples re- 
quired for this score is a function of the 
reliability of the items. Two procedures 
were used to evaluate this. 

First, the split-half procedure was 
applied, the pooled responses of ten 
judges to the odd items being correlated 
with the pooled ratings of the judges to 
the even items (r, .g5). ‘Two items, 
chosen randomly for each subject, were 
then discarded and the odd-even split- 
half technique applied again. Similarly, 
four items per subject were discarded. 
These observed correlations, .95, .93, 
and .83—shown in Table 4—were in- 
serted into the Spearman-Brown formu- 
la and the reliability of one item, of two 
items, . . . of six items was predicted. 
The results are detailed in Table IV. 

Second, the analysis-of-variance pro- 
cedures described above in connection 


~ (Observed coefficient) (1 minus desired coefficient) 


sentative of the matrix than is .83 of 
Row 1 of Table IV. The latter was de- 
rived through a procedure that included 
sampling; the former was computed 
from all of the data. This observation 
pertains also to the values associated 


with two items and three items in 
Tables IV and V. 
The prediction formula that was 


used to determine the number of judges 
that would be required to maintain a 
reliability of .go was applied to deter- 
mine the number of items that would 
achieve this reliability and yielded 2.0 
items. Table V indicates the consistency 
of different items and the extent to 
which a single ten-second item reveals 
a child’s articulation. 


RELIABILITY OF SUBJECTS 

Attempts to predict by the Spearman- 
Brown prophecy formula the number of 
subjects required to yield specified re- 
sults have not been successful.1+*5 The 
split-half technique is hardly applicable 
in the present instance. The data may 
be envisaged as aligned in ten rows 
(judges) and two columns, twenty sub- 
jects pooled in each value. The out- 
come of this correlation would tell more 
about judges than about subjects. 

14 Lyle H. Lanier, “Prediction of the Reli- 
ability of Mental Tests and Tests of Special 
Abilities,” J. Exp. Psychol., X (April, 1927), 
69-113. 

15L. L. Thurstone, “A Note on the Spear- 


man-Brown Formula,” J. Exp. Psychol., XI 
(February, 1928), 62-63. 
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TABLE IV 
OBSERVED AND PREDICTED VALUES FOR r FOR VARIOUS NUMBERS OF ITEMS USING OBSERVED 
PRODUCT-MOMENT COEFFICIENTS AS THE BASES FOR PREDICTION BY THE 
SPEARMAN-BROWN FORMULA. 

















Observed r Predicted r 
Numbers 
of Items 1 2 3 4 5 6 
1 83 nad gl ‘94 ‘95 -96 ‘97 
2 -93 87 _— 95 .96 97 98 
3 ‘95 86 93 — 96 ‘97 ‘97 
TABLE V 


OBSERVED AND PREDICTED VALUES FOR YF FOR VARIOUS NUMBERS OF ITEMS. OBSERVED 
COEFFICIENTS COMPUTED BY ANALYSIS OF VARIANCE. PREDICTED VALUES 
COMPUTED BY SPEARMAN-BROWN FORMULA. 








Observed r* 


Predicted r 





Number 

of Items 1 2 3 4 5 6 
1 85 oes 2 94 .96 .93 97 
“ 93 87 - ‘95 96 ‘97 98 
3 5 84 gl — .96 .g6 .96 
4 95 84 gl ‘O4 - 96 .96 
5 -96 85 92 95 96 a 97 
6 .Q7 85 Q2 .O5 .96 .96 — 








*The observed value of r for N —1 is an average intercorrelation (r) computed by intra- 
class correlation; other observed values (r) are computed by the second analysis of variance for- 
mula cited above, the one with a single term in the denominator. 


The analysis-of-variance procedure 
that was followed above in the treat- 
ment of judges and items offers some 
possibility for determining the reliabil- 
ity of subjects. The matrix suggested 
above would be spread out over forty 
columns (subjects). First the average in- 
ter-correlation (intraclass correlation) 
would be determined (.14). Predicted 
correlations from this value for various 
numbers of subjects are presented in the 
bottom row of Table VI. The matrix 
was then reduced by five columns (sub- 
jects) and the intraclass correlation re- 
computed. This process was continued 
for randomly selected populations of 5, 
10, 15, 20, 25, 30, and 35 subjects from 
the entire matrix, and each of the ob- 
tained average intercorrelations (intra- 
class correlation) was used as a basis for 
predicting the reliability of groups of 
subjects of the six other sizes. These 
values, obtained and predicted, are pre- 
sented in Table VI. The several vari- 


ances that enter into the computation 
are presented in Table VII. 

The observed r’s in Table VI are av- 
erage intercorrelations computed by the 
Ebel analysis-of-variance technique for 
intraclass correlation. These values in- 
dicate estimates of the reliability of an 
average subject in a group of 5, 10, 15, 
20, 25, 30, 35, as well as 40 subjects. The 
low average intercorrelation of the sub- 
jects is in keeping with an individual- 
differences and a random-factor concept 
of subjects. Each value, in turn, was used 
as the basis for a series of predictions of 
the reliability of populations of other 
sizes. An illuminating comparison arises 
from (a) these predicted measures and 
(b) the estimates of reliability yielded 
by the Ebel-Hoyt analysis-of-variance 
technique for stating the reliability of 
a panel. These obtained values were: 


For five subjects 45 
For ten subjects 52 
For fifteen subjects 78 
For twenty subjects 64 
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TABLE VI 
OBSERVED AVERAGE INTERCORRELATION VALUES FOR AN AVERAGE SUBJECT OF FORTY SUBJECTS AND OF 
SEVEN GROUPS OF SUBJECTS DRAWN FROM THE FORTY AND THE PREDICTED RELIABILITY OF 
Groups OF SEVEN SIZES BASED ON EACH OBSERVED VALUE. 








Observed r 
Number of 


Predicted r 





Subjects 5 10 15 20 25 30 35 40 
5 08 _— 15 21 .26 31 34 38 41 

10 19 -10 _— 26 32 37 41 45 48 
15 19 07 13 — 16 .29 $2 35 39 
20 .08 02 O04 06 — 09 11 13 15 
25 13 03 .06 .08 as _— 15 17 19 
30 15 03 05 .08 19 17 — 18 19 
35 14 02 05 07 .08 -10 12 — 15 
40 14 02 04 .06 08 09 lI 12 _ 








TABLE VII 
SUMMARY OF THE DISTRIBUTION OF VARIANCE AMONG THE RATINGS OF 10 JUDGES RELATIVE TO 40 


SUBJECTS AND TO 5, 10, 15, 20, 


5,» 30, AND 35 SUBJECTS RANDOMLY SELECTED FROM THE 40. 


(Degrees of freedom are indicated in parentheses) 








Number of 














Subjects 5 10 15 20 25 30 35 40 
Source of Variation 
Subj. = 
Cols. 24.06 (4) 27.22(9) 21.62(14) 28.51(19) 25.01(24) 23.24(29) 24.40(34) 25.59 (39) 
Judges — 
Rows (g) 1.26 1.90 3.66 2.30 5.06 4.89 6.52 25 
Remainder  .87 (36) .g1(81) .79(126) .83(171) 1.04 (216) .76 (261) .97(306) —.95 (351) 
For twenty-five subjects ‘92 ing these sources of economy would pre- 
For thirty subjects 84 sumably exceed the tolerable limits. An 
For thirty-five subjects 85 si ‘ 
' go ” assumption was made that items and 
For forty subjects 97 


A lack of systematic growth of the meas- 
ures of reliability with larger numbers 
of subjects indicates the sensitivity of 
the procedure to sampling error. 


DISCUSSION 

An effect of this study is to highlight 
three sources of relative reliability in a 
rating procedure such as the one il- 
lustrated here, a rating of children’s 
articulation. This reliability, in turn, 
may be computed and stated in several 
ways, equally appropriately. 

An object in the present instance was 
to ferret out feasible economies in the 
testing procedure that was under study. 
With ten judges, the six items could 
have been reduced by two thirds, or 
with six items the judges could have 
been reduced by two thirds and a re- 
liability of .go maintained. Compound- 


judges were equal in affecting the reli- 
ability of the test and that a reliability 
of .go might be maintained by reducing 
both by one third (six judges and four 
items). Accordingly, the total matrix was 
reduced through random selection to 
these proportions and two split-half cor- 
relations were determined, one based on 
three versus three judges and the other 
on two versus two items, The former 
yielded a coefficient of .87 and the latter 
.g1, indicating that no further economiz- 
ing was in order if a reliability coefficient 
of .go was to be claimed and tending to 
confirm that items and judges were ap- 
proximately equally contributory to test 
reliability. 
CONCLUSIONS 

1. The kind of test material employed 
in appraising children’s articulation 
yields judgmental data that fall within 
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the range of application of the Spear- 
man-Brown prophecy formula in terms 
of test item and judge reliability, but 
less so with respect to subjects. 

2. Equivalent statements of reliability 
may be obtained, in instances like the 
present one, from comparing parts (e.g. 
halves) of a test or from computing an 
average intercorrelation (or by averag- 
ing N (N-1)/2 individual correlations). 
The average intercorrelation can be 
computed by a Z transformation or by 
analysis of variance. Similarly there may 
be alternative procedures for stating the 
reliability of a population. This value 
may then be used to predict the reli- 
ability of a subsample of the population. 


SUMMARY 


Data from an articulation rating test 
were evaluated with respect to reliabil- 
ity. The procedures that were employed 
included (a) split-half and other seg- 


mentings of the measures from ten 
judges, forty subjects, and six items per 
subject (the entire set of measures was 
also replicated); (b) average intercorrela- 
tion of the judges by means of a Z-score 
transformation of the data; and (c) av- 
erage intercorrelation by means of anal- 
ysis of variance (intraclass correlation). 
Interpretation of the obtained values 
was facilitated by an extensive use of 
the Spearman-Brown prophecy formula 
applied to measures obtained by the 
three foregoing procedures. These val- 
ues, in turn, were In some instances com- 
pared with measures of reliability that 
were computed by analysis of variance 
and intended to state the reliability of 
a “group,” not an average individual. 

The Spearman-Brown formula, known 
to predict when items are equivalent, 
was applicable in extending measures 
obtained from judges and items, not 
subjects. 
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